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1 GACAGAGTGCAGCCTTTTCAGACTCTGTGACACAGTTCCCCTTTT 
46 6CAAAAATACTTAGCGAGGATCATTACTTTCCAACAGTCGT6TCC 
9 1 AGAGACCTACTTTGTAACACCGCAG6GAAGTTAATGTACTA6GTC 
136 TTGAAAGGTCTTTCTGGAATGTGCAGTAACTTGTAGTTTTCTTCT 
181 AGTAGCACTGCTAATTTTTGTGTTATAATTTTTGTAGGTCCATGG 

226 GGCCGATGTATGGGA6ATGAAT6T6GTCCCGGAGGCATCCAAACG 

MetGlyAspGluCysGlyProGlyGlylleGlnThr 

271 AGGGCT6TGTGGTGT6CTCATGT6GAGGGAT66ACTACACTGCAT 
ArgAlaValTrpCy sAlaHi sValGluGlyTrpThrThrLeuHi s 

316 ACTAACTGTAAGCAGGCCGAGA6ACCCAATAACCAGCAGAATTGT 
ThrAsnCysLysGlnAlaGluArgProAsnAsnGlnGlnAsnCys 

361 TTCAAAGTTTGCGATTGGCACAAAGAGTTGTACGACTGGAGACTG 
PheLysValCysAspTrpHisLysGluLeuTyrAspTiT>ArgLeu 

406 GGACCTTGGAATC AGTGTCAGCCCGTGATTTC AAAAAGCCTAGAG 
GlyProTrpAsnGlnCysGlnProVallleSerLysSerLeuGlu 

451 AAACCTCTTGAGTGCATTAAGGGGGAAGAAGGTATTCAGGTGAGG 
Ly sProLeuGluCy s 1 1 eLy sGlyGluGluGly I leGlnValArg 

496 6AGATAGC6T6CATCCAGAAAGACAAAGACATTCCTGCGGA6GAT 
Glul leAlaCys 1 1 eGlnLy sAspLy sAspI leProAlaGluAsp 

54 1 ATCATCTGT6A6TACTTTGAGCCCAAGCCTCTCCTGGAGCAGGCT 
IlelleCysGluTyrPheGluProLysProLeuLeuGluGlnAla 

586 TGCCTCATTCCTTGCCAGCAAGATTGCATCGTGTCT6AATTTTCT 
CysLeuIleProCysGlnGlnAspCysIleValSerGluPheSer 

631 GCCTGGTCCGAATGCTCCAAGACCTGCGGCAGCGGGCTCCAGCAC 
AlaTrpSerGluCysSerLysThrCysGlySerGlyLeuGlnHis 

67 6 CGGACGCGTCATGTGGTGGCGCCCCCGCAGTTCGGAGGCTCTGGC 
ArgThrArgHisValValAlaProProGlnPheGlyGlySerGly 

721 TGTCCAAACCTGACGGAGTTCCAGGTGTGCCAATCCA6TCCATGC 
CysProAsnLeuThrGluPheGlnValCysGlnSerSerProCys 

766 GAGGCC6AGGAGCTCAGGTACAGCCTGCATGT6GGGCCCT66A6C 
GluAlaGluGluLeuArgTyrSerLeuHisValGlyProTrpSer 

811 ACCTGCTCAATGCCCCACTCCCGACAAGTAA6ACAA6CAAGGAGA 
ThrCysSerMetProHisSerArgGlnValArgGlnAlaArgArg 
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856 C6CGG6AA6AATAAA6AAC666AAAA66ACC6CAGCAAA66A6TA 
ArgGlyLysAsnLysGlxiArgGluLysAspArgSerLysGlyVal 

901 AA66ATCCAG2U^GCCC6CGAGCTTATTAAGAAAAA6AGA2VACA6A 
LysAspProGluAlaArgGluLeuIleLysLysLysArgAsnArg 

946 AACA6GCA6AACAGACAAGA6AACAAATATTGGGACATCCAGATT 
AsnArgGlnAsnArgGlnGluAsnLysTyrTrpAspIleGlnlle 

991 GGATATCA6ACCAGAGA6GTTATGTGCATTAACAA6ACGGGGAAA 
GlyTyrGlnThrArgGluValMetCys IleAsnLysThrGlyLys 

1036 GCTGCTGATTTAAGCTTTTGCCAGCAAGAGAAGCTTCC AATGACC 
AlaAlaAspLeuSerPheCysGlnGlnGluLysLeuProMetThr 

1081 TTCCAGTCCT6TGTGATCACCAAAGAGTGCCAGGTTTCCGAGTGG 
PheGlnSerCysVallleThrLysGluCysGlnValSerGluTrp 

112 6 TCA6AGTG6AGCCCCTGCTCAAAAACATGCCATGACAT6GTGTCC 
SerGluTrpSerProCysSerLysThrCysHisAspMetValSer 

1 17 1 CCTGCAGGCACTCGTGTAA6GACACGAACCATCAGGCAGTTTCCC 
ProAlaGlyThrArgValArgThrArgThrlleArgGlnPhePro 

1216 ATT6GCAGTGAAAAGGAGTGTCCAGAATTTGAAGAAAAA6AACCC 
lleGlySerGluLysGluCysProGluPheGluGluLysGluPro 

1261 TGTTTGTCTCAAGGA6ATGGAGTTGTCCCCT6TGCCACGTATGGC 
CysLeuSerGlnGlyAspGlyValValProCysAlaThrTyrGly 

1306 TGGA6AACTACAGAGTGGACTGAGTGCCGTGTGGACCCTTTGCTC 
TrpArgThrThrGluTrpThrGluCysArgValAspProIfeuIteu 

1351 AGTCAGCAGGACAAGAGGCGCGGCAACCAGACGGCCCTCTGTG6A 
SerGlnGlnAspLysArgArgGlyAsnGlnThrAlaLeuCysGly 

1396 GGGGGCATCCA6ACCCGAGAGGTGTACTGCGTGCAGGCCAACGAA 
GlyGlylleGlnThrArgGluValTyrCysValGlnAlaAsnGlu 

1441 AACCTCCTCTCACAATTAAGTACCCACAAGAACAAAGAAGCCTCA 
AsnLeuLeuSerGlnLeuSerThrHisLysAsnLysGluAlaSer 

1486 AAGCCAATGGACTTAAAATTATGCACT6GACCTATCCCTAATACT 
LysProMetAspLeuLysLeuCysThrGlyProIleProAsnThr 

1531 ACACAGCTGTGCCACATTCCTTGTCCAACTGAATGTGAAGTTTCA 
ThxrGlnLeuCysHisIleProCysProThrGluCysGluValSer 
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157 6 CCTTGGTCAGCTTGGGGACCTTGTACTTATGAA2U^CTGTAATGAT 
ProTrpSerAlaTrpGlyProCysThrTyrGlTiAsnCysAsnAsp 



1621 CAGCAAGGGAAAAAAGGCTTCAAACTGAGGAAGCGGCGCATTACC 
GlnGlnGlyLysLysGlyPhelfysLeuArgLysArgArglleThr 

1666 AATGAGCCC ACTGGA6GCTCT6GGGTAACCGGAAACTGCCCTCAC 
AsnGluProThrGlyGlySerGlyValThrGlyAsnCysProHis 

1711 TTACTG6AAGCCATTCCCTGTGAAGAGCCTGCCTGTTATGACTGG 
LeuLeuGluAlalleProCysGluGluProAlaCysTyrAspTrp 

1756 AAAGCGGTGAGACTGG6A6ACTGC6AGCCAGATAACGGAAAGGAG 
LysAlaValArgLeuGlyAspCysGluProAspAsnGlyLysGlu 

1801 TGTGGTCCAGGCAC6CAAGTTCAAGAGGTTGTGTGCATCAACA6T 
CysGlyProGlyThrGlnValGlnGluValValCysIleAsnSer 

1846 GATGGA6AAGAAGTTGAC AGACAGCTGTGCA6AGATGCCATCTTC 
AspGlyGluGluValAspArgGlnLeuCysArgAspAlallePhe 

1891 CCCATCCCTGTGGCCTGT6ATGCCCCATGCCCGAAAGACTGTGTG 
ProIleProValAlaCysAspAlaProCysProLysAspCysVal 

1936 CTCAGC AC ATGGTCTACGTGGTCCTCCTGCTC AC AC ACCTGCTC A 
LeuSerThrTrpSerThrTrpSerSerCysSerHisThrCysSer 

1981 GGGAAAACGACAGAAGG6AAACAGATACGAGCACGATCCATTCTG 
GlyLy sThrThrGluGlyLy sGlnl leArgAlaArgSerl leLeu 

2026 GCCTATGCGGGTGAAGAAGGTGGAATTCGCTGTCCAAATA6CAGT 
AlaTyrAlaGlyGluGluGlyGlylleArgCysProAsnSerSer 

2071 GCTTTGCAA6AAGTAC6AAGCTGTAAT6AGCATCCTTGCACAGTG 
AlaLeuGlnGluValArgSerCysAsnGluHisProCysThrVal 

2116 TACCACTGGCAAACTGGTCCCTGGGGCC AGTGCATTGAG6ACACC 
TyrHisTrpGlnThrGlyProTrpGlyGlnCysIleGluAspThr 

2161 TCAGTATCGTCCTTCAACACAACTACGACTTGGAATGGGGAGGCC 
SerValSerSerPheAsnThxrThrThrThrTrpAsnGlyGluAla 

2206 TCCTGCTCTGTCGGC ATGCAGACAAGAAAAGTCATCTGTGTGCGA 
SerCysSerValGlyMetGlnThrArgLysVallleCysValArg 

2251 GTCAATGTGGGCC AAGTGGGACCCAJUUUU^TGTCCTGAT^GCCTT 
ValAsnValGlyGlnValGlyProLysLysCysProGluSerLeu 
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2296 CGACCTGAAACTGTAAGGCCTTGTCTGCTTCCTTGTAAGAAGGAC 
ArgProGluThrValArgProCysLeuLeuProCysLysLysAsp 

2341 TGTATTGTGACCCCATATAGTGACT6GACATCATGCCCCTCTTC6 
Cy s I 1 eVa iThr ProTyr Ser AspTrpThr SerCy sProSerSer 

2386 TGTAAAGAAGG66ACTCCAGTATCAGGAAGCAGTCTAGGCATCGG 
CysLysGluGlyAspSerSerlleArgLysGlnSerArgHisArg 

2431 GTCATCATTCA6CTGCCAGCCAACGGGGGCCGAGACTGCACA6AT 
ValllelleGlnLeuProAlaAsnGlyGlyArgAspCysThrAsp 

2476 CCCCTCTATGAAGAGAAGGCCTGTGAGGCACCTCAA6CGTGCCAA 
ProLeuTyrGluGluLysAlaCysGluAlaProGlnAlaCysGln 

2521 AGCTACAGGTGGAA6ACTCACAAATGGCGCAGATGCCAATTAGTC 
SerTyrArgTrpLysThrHisLysTirpArgArgCysGlnLeuVal 

2566 CCTTGGAGC6TGCAAC AAGACAGCCCTGGAGC ACAGGAAGGCT6T 
ProTrpSerValGlnGlnAspSerProGlyAlaGlnGluGlyCys 

2611 G6GCCTGGGCGACAGGCAAGA6CCATTACTT6TC6CAAGCAAGAT 
GlyProGlyArgGlnAlaArgAlalleThrCysArgLysGlnAsp 

2656 GGAGGACAGGCTGGAATCCATGAGTGCCTACAGTATGCAGGCCCT 
GlyGlyGlnAlaGlylleHisGluCysLeuGlnTyrAlaGlyPro 

2 70 1 GTGCCAGCCCTTACCCAGGCCTGCCA6ATCCCCTGCCAG6ATGAC 
ValProAlaLeuThrGlnAlaCysGlnlleProCysGlnAspAsp 

2746 TGTCAATTGACC AGCTGGTCC AAGTTTTCTTC ATGCAATGGAGAC 
CysGlnLeuThrSerTrpSerLysPheSerSerCysAsnGlyAsp 

2791 TGTG6TGCAGTTAGGACCAGAAAGCGCACTCTTGTTG6A2VAAAGT 
CysGlyAlaValArgThrArgLysArgThrLeuValGlyLysSer 

2836 AAAAAGAAGGAA/U^TGTAAAAATTCCCATTTGTATCCCCTGATT 
LysLysLysGluLysCysLysAsnSerHisLeuTyrProLeuIle 

2881 GA6ACTCA6TATT6TCCTT6T6ACAAATATAAT6CACAACCTGTG 
GluThrGlnTyrCysProCysAspLysTyrAsnAlaGlnProVal 

2926 GG6AACTG6TCA6ACTGTATTTTACCAGAGGGAAAAGT6GAA6TG 
GlyAsxiTrpSerAspCy s 1 1 eLeuProGluGlyLysValGluVal 

2971 TTGCTGG6AATGAAAGTACAAGGAGACATCAAGGAATGCGGACAA 
LeuLeuGlyMetLysValGlnGlyAspIleLysGluCysGlyGln 
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3016 GGATATCGTTACCAAGCAATGGCATGCTACGATCAAAATGGCA6G 
GlyTyrArgTyrGlnAlaMetAlaCysTyrAspGlnAsnGlyArg 

3061 CTTGTGGAAAC ATCTAGATGTAAC A6CC ATGGTTAC ATTGAGGAG 
LeuValGluThrSerArgCysAsnSerHisGlyTyrlleGluGlu 

3106 GCCTGCATCATCCCCTGCCCCTCAGACTGCAAGCTCAGTGAGTGG 
AlaCysIlelleProCysProSerAspCysLysLeuSerGluTrp 

3151 TCCAACTG6TC6CGCTGCAGCAA6TCCTGTGGGAGTGGTGTGAA6 
SerAsnTrpSerArgCysSerLysSerCysGlySerGlyValLys 

3196 6TTC6TTCTAAATGGCT6CGTGAAAAACCATATAAT6GAGGAA6G 
ValArgSerLysTrpLeuArgGluLysProTyrAsnGlyGlyArg 

3241 CCTTGCCCCAAACTGGACC ATGTCAACCAGGCACAGGTGTAT6AG 
ProCysProLysIieuAspHisValAsnGlnAlaGlnValTyrGlu 

3286 GTTGTCCCATGCCACAGTGACTGCAACCAGTACCTATGGGTCACA 
ValValProCysHisSerAspCysAsnGlnTyrLeuTrpValThr 

3331 GAGCCCT6GAGC ATCTGCAAGGTGACCTTTGTGAATATGCGGGAG 
GluProTrpSerlleCysLysValThrPheValAsnMetArgGlu 

3376 AACTGTG6AGAGGGCGTGCAAACCCGAAAAGTGAGATGCATGCAG 
AsnCysGlyGluGlyValGlnThrArgLysValArgCysMetGln 

3421 AATACAGCAGATGGCCCTTCT6AACAT6TAGAGGATTACCTCT6T 
AsnThrAlaAspGlyProSerGluHisValGluAspTyrLeuCys 

3466 GACCCAGAAGA6ATGCCCCTGGGCTCTAGAGT6TGCAAATTACCA 
AspProGluGluMetProLeuGlySerArgValCysLysLeuPro 

3511 TGCCCTGAGGACTGTGTGATATCTGAATGGGGTCCATGGACCCAA 
CysProGluAspCysVallleSerGluTrpGlyProTrpThrGln 

3556 TGTGTTTTGCCTTGCAATCAAAGCAGTTTCCGGCAAAGGTCAGCT 
CysValLeuProCysAsnGlnSerSerPheArgGlnArgSerAla 

3601 6ATCCCATCAGACAACCAGCTGATGAAGGAAGATCTTGCCCTAAT 
AspProIleArgGlnProAlaAspGluGlyArgSerCysProAsn 

3646 GCTGTTGAGAAAGAACCCTGTAACCTGAACAAAAACTGCTACCAC 
AlaValGluLysGluProCysAsnLeiiAsnLysAsnCysTyrHis 

3691 TATGATTATAAT6TAACAGACTGGAGTACATGTCAGCTGAGTGA6 
TyrAspTyrAsnValThrAspTrpSerThrCysGlnLeuSerGlu 
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3736 AAGGC A6TTTGTGGAAATGGAATAAAAAC AAGGATGTTGGATTGT 
Ly s Al aVa ICy sGlyAsnGly 1 1 eLy sThr ArgMe t LeuAspCy s 

3781 GTTCGAAGTGATGGCAAGTCAGTTGACCTGAAATATTGT6AAGCG 
ValArgSerAspGlyLysSerValAspLeuLysTyrCysGluAla 

3826 CTTGGCTTG6AGAAGAACTGGCA6ATGAACACGTCCTGCATGGTG 
LeuGlyLeuGluLysAsnTrpGlnMetAsnThrSerCysMetVal 

3871 GAATGCCCTGTGAACTGTCAGCTTTCTGATTGGTCTCCTTGGTCA 
GluCysProValAsnCysGlnLeuSerAspTrpSerProTrpSer 

3916 GAATGTTCTCAAACATGTGGCCTCACAGGAAAAATGATCCGAAGA 
GluCysSerGlnThrCysGlyLeuThrGlyLysMetlleArgArg 

3961 CGAACAGTGACCCAGCCCTTTCAAGGTGATGGAAGACCATGCCCT 
ArgThrValThrGlnProPheGlnGlyAspGlyArgProCysPro 

4006 TCCCTGATGGACCAGTCCAAACCCTGCCCAGTGAAGCCTTGTTAT 
SerLeuMetAspGlnSerLysProCysProValLysProCysTyr 

4051 CGGTGGCAATATGGCCAGTGGTCTCCATGCC AAGTGCAG6AGGCC 
ArgTrpGlnTyrGlyGlnTrpSerProCysGlnValGlnGluAla 

4096 CAGTGTGGAGAAGGGACCAGAACAAGGAAC ATTTCTTGTGTAGTA 
GlnCy sGlyGluGlyThrArgThr ArgAsnl 1 eSerCy sValVal 

4 14 1 A6TGATGG6TCAGCTGAT6ATTTCAGCAAAGTGGTGGATGAGGAA 
SerAspGlySerAlaAspAspPheSerLysValValAspGluGlu 

4186 TTCTGTGCT6ACATTGAACTCATTATAGATGGTAATAAAAATATG 
PheCysAlaAspIleGluLeuIlelleAspGlyAsnLysAsnMet 

4231 GTTCTGGAGGAATCCTGCAGCCAGCCTTGCCCAGGTGACTGTTAT 
ValLeuGluGluSerCysSerGlnProCysProGlyAspCysTyr 

427 6 TTGAAGGACTGGTCTTCCTGGAGCCTGTGTCAGCT6ACCTGTGTG 
LeuLysAspTrpSerSerTrpSerLeuCysGlnLeuThrCysVal 

4321 AATGGTGAG6ATCTAGGCTTTGGTG6AATACAGGTCAGATCCAGA 
AsnGlyGliiAspLeuGlyPheGlyGlylleGlnValArgSerArg 

4366 CCGGTGATTATACAAGAACTAGAGAATC AGCATCTGTGCCCAGAG 
ProVal I le I leGlnGluLeuGluAsnGlnHi sif euCy sProGlu 

4411 CAGAT6TTAGAAACAAAATCATGTTAT6ATG6ACAGT6CTATGAA 
GlnMetLeuGluThrLysSerCysTyrAspGlyGlnCysTyrGlu 

4456 TATAAATGGATGGCCAGTGCTTGGAAGGGCTCTTCCCGAACAGTG 
TyrLysTrpMetAlaSerAlaTrpLysGlySerSerArgThrVal 
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TGGTGTCAAAGGTCAGATGGTATAAATGTAACAGGGGGCTGCTTG 
TrpCysGlnArgSerAspGlylleAsnValThrGlyGlyCysLeu 

GTGAT6AGCCAGCCTGATGCCGACA6GTCTTGTAACCCACCGTGT 
ValMetSerGlnProAspAlaAspArgSerCysAsnProProCys 

A6TCAACCCCACTCGTACTGTA6C6AGACA7UUVACATGCCATTGT 
SexrGlnProHisSerTyrCysSerGluThrLysThrCysHisCys 

GAAGAAGGGTACACTGAAGTCATGTCTTCTAACAGCACCCTTGAG 
GluGluGlyTyrThrGluValMetSerSerAsnSerThrLeuGlu 

CAATGCACACTTATCCCCGTGGTGGTATTACCCACCATGGAGGAC 
GlnCysThrLeuIleProValValValLeuProThrMetGluAsp 

AAAAGAGGAGATGTGAAAACCAGTCGGGCTGTACATCCAACCCAA 
LysArgGlyAspValLysThrSerArgAlaValHisProThrGln 

CCCTCCAGTAACCCAGCAGGACGGGGAAGGACCTGGTTTCTACAG 
ProSerSerAsnProAlaGlyArgGlyl^gThrTrpPheLeuGln 

CCATTTGGGCCAGATGGGAGACTAAAGACCTGGGTTTACGGTGTA 
ProPheGlyProAspGlyArgLeuLysThrTrpValTyrGlyVal 

6CAGCTGGGGCATTTGTGTTACTCATCTTTATTGTCTCCATGATT 
AlcU^laGlyAlaPheValLeuLeuIlePhelleValSerMetlle 

TATCTAGCTTGCAAAAAGCCAAA6AAACCCCAAA6AAGGCAAAAC 
TyrLeuAlaCysLysLysProLysLysProGlnArgArgGlnAsn 

AACCGACTGAAACCTTTAACCTTAGCCTATGATGGAGATGCCGAC 
AsnArgLeuLysProLeuThrLeuAlaTyrAspGlyAspAlaAsp 

ATGTAACATATAACTTTTCCTGGCAACAACCAGTTTCGGCTTTCT 
Met 
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5041 GACTTCATAGATGTCCAGAGGCCAC AACAAATGTATCCAAACTGT 

5086 GTGGATTAAAATATATTTTAATTTTTAAAAATGGCATCATAAAGA 

5131 CAAGAGTGAAAATCATACTGCCACTGGAGATATTTAAGACAGTAC 

5176 CACTTATATACAGACCATCAACCGTGAGAATTATAGGAGATTTAG 

5221 CTGAATACATGCTGCATTCTGAAAGTTTTATGTCATCTTTTCTGA 

5266 AATCTACCGACTGAAAAACCACTTTCATCTCTAAAAAATAATGGT 

5311 GGAATTGGCCAGTTA6GATGCCTGATACAAGACCGTCTGCAGT6T 

5356 TAATCCATAAAACTTCCTAGCATGAAGAGTTTCTACC AAGATCTC 

5401 CAC AATACTATGGTCAAATTAACATGTGTACTCAGTTGAATGACA 

5446 CACATTATGTCAGATTATGTACTTGCTAATAAGCAATTTTAACAA 

5491 TGCATAACAAATAAACTCTAAGCTAAGCAGAAAATCCACT6AATA 

5536 AATTCAGCATCTTGGTGGTCGATGGTAGATTTTATTGACCTGCAT 

5581 TTCAGAGACAAAGCCTCTTTTTTAAGACTTCTTGTCTCTCTCCAA 

5626 AGTAAGAATGCTGGACAAGTACTAGTGTCTTAGAAGAACGAGTCC 

5671 TCAAGTTCAGTATTTTATAGTGGTAATTGTCTGGAAAACTAATTT 

5716 ACTTGTGTTAATACAATACGTTTCTACTTTCCCTGATTTTCAAAC 

5761 TGGTTGCCTGCATCTTTTTTGCTATATGGAAGGCACATTTTTGCA 

5806 CTATATTAGTGCAGCACGATAGGCGCTTAACCAGTATTGCC ATA6 

5851 AAACTGCCTCTTTTCATGTGGGATGAAGACATCTGTGCCAAGAGT 

5896 GGC ATGAAGACATTTGCAAGTTCTTGTATCCTGAAGAGA6TAAAG 

5941 TTCAGTTTGGATGGCAGCAAGATGAAATCAGCTATTACACCTGCT 

5986 GTAC ACACACTTCCTC ATC ACTGCAGCC ATTGTGAAATTGACAAC 

6031 ATGGCGGTAATTTAAGTGTTGAAGTCCCTAACCCCTTAACCCTCT 

6076 AAAAGGTGGATTCCTCTAGTTGGTTTGTAATTGTTCTTTGAAGGC 

6121 TGTTTATGACTAGATTTTTATATTT6TTATCTTTGTTAAGAAAAA 

6166 AAAAAGAAAAAGGAACTGGATGTCTTTTTAATTTTGAGCAGATGG 

6211 AGAAAATAAATAATGTATCAATGACCTTTGTAACTAAAGGAAAAA 

6256 AAlUyUULAATGTGGATTTTCCTTTCTCTCTGATTTCCCAGTTTCA 

6301 GATT6AAT6TCT6TCTTGCAG6CA6TTATTTCAA2UITCCATA6TC 

6346 TTTNGCCTTTCTCACTGGCAAAATTTGA 
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1 CACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCCCCTTTGGAG 

4 6 CCTCCTGCT6GGCCACTGGCTGGGATCAGGACACCAGTGATG6TA 

9 1 AGTGCTGGCCCAGACTGAAGCTCGGAGAGGCACTCTGCTTGCCCA 

136 GCGTCACAGTCTTAGCTCCCAACTGTCCTGGCTTCCAGTCTCCCT 

181 TGCTTCCCA6ATCCCAGACTCTAGCCCCAGCCCCGTCTCTTTCAC 

226 CAGCTCCTGGGACCCTACGCAATCTGCGCCTGCGTCTC ATCAGTC 

271 GCCCCACATGTAACT6TATCTACAACC AGCT6CACCAGCGACACC 

316 TGTCCAACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGC 

361 CTGGGGTGCAGGGCCCCTGTCAGGTCTGATAGGGAGAAGAGAAGG 

406 AGCAGAAGGGGAGGGGCCTAACCCTGGGCTGGGGGTTGGACTCAC 

451 AGGACTGGGGGAAAGAGCTGCAATCAGAGGGTGTCTGCCATAGCT 

496 GGGCTCAGGCATCTGTCCTTGGCTTTGTTGCCTGGCTCCAGGGAG 

541 ATTCCGG6GGCCCTGTGCT6TGCCTCGAGCCT6ACGGACACTGGG 

586 TTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGG 

631 ACGCTCCTGTGCTGCTGACCAACACAGCTGCTCACAGTTCCTGGC 

676 T6CAGGCTC6AGTTCAGGGGGCAGCTTTCCTGGCCCA6AGCCCA6 



721 AGACCCC6GAGATGA6TGATGAGGACAGCTGT6TAGCCT6T6GAT 

MetSerAspGluAspSerCysValAlaCysGlyS 

766 CCTTGAGGACAGCAGGTCCCCAGGCA6GAGCACCCTCCCCATGGC 
erLeuArgThrAlaGlyProGlnAlaGlyAlaProSerProTrpP 

811 CCTG6GAGGCCAGGCTGATGCACCAGGGACAGCTGGCCT6TGGCG 
roTrpGluAlaArgLeuMetHisGlnGlyGlnLeTiAlaCysGlyG 

856 GAGCCCTGGTGTCAGAGGA6GCGGTGCTAACT6CTGCCCACTGCT 
lyAlaLeuValSerGluGluAlaValLeuThrAlaAlaHisCysP 

901 TCAATGGGC6CCAGGCCCCAGAG6AATG6AGC6TAGGGCTGGGGA 
heAsnGlyArgGlnAlaProGluGluTrpSerValGlyLeuGlyT 

946 CCAGACCGGAGGAGTGGGGCCTGAAGCA6CTCATCCTGCAT6GAG 
hrArgProGluGluTrpGlyLeuLysGlnLeuIleLeuHisGlyA 

991 CCTACACCCACCCTGAGGGGGGCTAC6ACATGGCCCTCCTGCT6C 
laTyrThrHisProGluGlyGlyTyrAspMetAlaLeuLeuLetiL 

1036 TGGCTCAGCCTGTGACACTGGGAGCCAGCCTGCGGGCCCTCT6CC 
euAlaGlnProValThrlieuGlyAlaSerLeuArgAlaLeuCysIf 

1081 TGCCCTATTTT6ACCACCACCTGCCTGATGGGGAGCGTGGCTGGG 
euProTyrPheAspHi sHi sLeuProAspGlyGltiArgGlyTrpV 

112 6 TTCTGGGACGGGCCCGCCCAGGAGCAGGCATCAGCTCCCTCCA6A 
alLeuGlyArgAlaArgProGlyAlaGlylleSerSerLeuGlnT 



117 1 CAGTGCCC6TGACCCTCCTGGGGCCTAGGGCCT6CAGCCGGCTGC 
hrValProValThrLeuLeuGlyProArgAlaCysSerArgLeuH 
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1216 AT6CAGCTCCTGGGG6T6iVT6GCAGCCCTATTCTGCC66G6AT66 
IsAlaAlaProGlyGlyAspGlySerProIleLeuProGlyMetV 

1261 TGT6TACCA6TGCT6TGG6T6AGCTGCCCA6CT6TGA6GGCCT6T 
alCysThrSerAlaValGlyGluLeuProSerCysGluGlyLeuS 

1306 CT6GGGCACCACTGGTGCATGAG6TGAG6GGCACATGGTTCCTGG 
erGlyAlaProLeuValHisGluValArgGlyThrTrpPheLeuA 

1351 CCGGGCTGCACAGCTTCGGAGATGCTTGCCAAGGCCCCGCCAGGC 
laGlyLeuHisSerPheGlyAspAlaCysGlnGlyProAlaArgP 

1396 CGGCGGTCTTCACC6CGCTCCCTGCCTATGAG6ACTGGGTCAGCA 
roAlaValPheThrAlaLeuProAlaTyrGlviAspTrpValSerS 

1441 GTTTGGACTGGCAGGTCTACTTC6CCGAG6AACCAGAGCCC6AGG 
. erLeuAspTrpGlnValTyrPheAlaGluGluProGluProGluA 

1486 CTGAGCCTGGAAGCTGCCTGGCCAACATAAGCCAACCAACCAGCT 
laGluProGlySerCysLeuAlaAsnlleSerGlnProThrSerC 

1531 GCTGAC AGGGGACCTGGCCATTCTCAGGACAAGAGAATGCA6GC A 

ys 

1576 GGCAAATGGCATTACTGCCCCT6TCCTCCCCACCCT6TCATGT6T 

1621 GATTCCAGGCACCAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGA 

1666 AG6AACCTGCCT6GGGCCACAGGT6CCCCCTCCCCACCCTGCA6G 

17 1 1 ACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAG 

1756 CAGGCGTCTCAGCTTTCCTCCTCCTTTACCCTTTCAGATACAATC 

1801 ACGCCAGCCCCGTTGTTTT6AAAATTTCTTTTTTTGGGGGGCAGC 

1846 AGTTTTCCTTTTTTTAAACTTAAATAAATTGTTACAAAATAGACT 

1891 TTAG 
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1 GCGGATCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCT 
4 6 GCAACCAAGCGGGTCTTACCCCCGGTCCTCCGCGTCTCCAGTCCT 
9 1 CGC ACCTGGAACCCCAACGTCCCCGAGAGTCCCCGAATCCCCGCT 



13 6 CCCAGGCTACCTAAGAG6AT6AGCGGTGCTCC6ACGGCCGGGGCA 

HetSerGlyAlaProThrAlaGlyAla 

181 GCCCTGAT6CTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTCAG 
AlaLeuMetLeuCysAlaAlaThrAlaValLeuLeuSerAlaGln 

226 GGC66ACCC6TGCA6TCCAA6TCGCC6CGCTTTGCGTCCT666AC 
GlyGlyProValGlnSerLysSerProArgPheAlaSerTrpAsp 

271 GAGATGAATGTCCTGGCGCACGGACTCCTGCAGCTCGGCCAGGGG 
GluMetAsnValLeuAlaHisGlyLeuLeuGlxiLeuGlyGlnGly 

316 TGCGC6AACACCGGAGCGCACCCGC A6TCAGCTGAGCGCGCTGGA 
CysAlaAsnThrGlyAlaHisProGlnSerAlaGluArgAlaGly 

361 GCGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGG 
AlaArgLeuSerAlaCysGlySerAlaCysGlnGlyThrGluGly 

406 TCCACCGACCTCCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAG 
SerThrAspLeuProLeuAlaProGluSerArgValAspProGlu 

451 6TCCTTCACAGCCTGCAGACACAACTCAAGGCTCAGAACAGCAGG 
ValLeuHisSerLeuGlnThrGlnLeuLysAlaGlnAsnSerArg 

496 ATCCAGCAACTCTTCCACAAG6TGGCCCAGCAGCAGCGGCACCT6 
IleGlnGlnLeuPheHisLysValAlaGlnGlnGlnArgHisLeu 

541 GAGAAGCAGCACCTGC6AATTCAGC ATCTGCAAAGCCAGTTT6GC 
GluLysGlnHisLeuJ^glleGlnHisLeuGlnSerGlnPheGly 

586 CTCCTGGACCACAAGCACCTAGACCATGAGGTGGCCAAGCCTGCC 
LeuLeuAspHisLysHisLeuAspHisGluValAlaLysProAla 

631 CGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCCGGCT 
ArgArgLysArgLeuProGlxiMetAlaGlnProValAspProAla 

676 CACAATGTCAGCCGCCTGCACCGGCTGCCCAGGGATTGCCAGGAG 
HisAsnValSerArgLeuHisArgLeuProArgAspCysGlnGlu 

721 CTGTTCCAGGTTGGGGAGAGGCAGAGTG6ACTATTT6AAATCCAG 
LeuPheGlnValGlyGluArgGlnSerGlyLeuPheGluIleGln 

766 CCTCAGGGGTCTCCGCCATTTTTGGTGAACTGCAAGAT6ACCTCA 
ProGlnGlySerProProPheLeuValAsnCysLysMetThrSer 
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811 GATGGAGGCTGGAC AGTAATTCA6AGGCGCCACGATGGCTCAGTG 
AspGlyGlyTrpThrVal IleGlnArgAr gHi s AspGlySerVal 

856 GACTTCAACCGGCCCTGGGAAGCCTACAAGGCG6GGTTTGGGGAT 
AspPheAsxiArgProTrpGluAlaTyrLysAlaGlyPheGlyAsp 

901 CCCCACGGC6A6TTCTGGCTGGGTCTG6AGAA66TGCATAGCATG 
ProHi sGlyGluPheTrpLeuGlyLeuGluLy sValHi sSerMe t 

946 ATGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACTGG 
MetGlyAspArgAsnSerArgLeuAlaValGlnLeuArgAspTrp 

991 GATGGCAACGCCGAGTTGCTGC AGTTCTCCGTGC ACCTGGGTGGC 
AspGlyAsnAlaGluLeuLeuGlnPheSerValHisLeuGlyGly 

1036 GAGGACACGGCCTATAGCCTGCAGCTCACTGCACCC6TGGCCGGC 
GlxiAspThrAlaTyrSerLeuGlnLeuThrAlaProValAlaGly 

1081 CAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGTACCC 
GlnLeuGlyAlaThrThrValProProSerGlyLeuSerValPro 

1126 TTCTCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAAC 
PheSerThrTrpAspGlnAspHisAspLeuArgArgAspLysAsn 

1171 TGCGCCAAGAGCCTCTCTGGAGGCTGGTG6TTTGGCACCTGCAGC 
CysAlaLysSerLeuSerGlyGlyTrpTrpPheGlyThrCysSer 

1216 CATTCCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACAGCAG 
HisSerAsnLeuAsnGlyGlnTyrPheArgSerlleProGlnGln 

1261 CGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGC 
ArgGlnLysLeuLysLysGlyllePheTrpLysThrTxpArgGly 

1306 CGCTACTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCATG 
ArgTyrTyrProLeuGlnAlaThrThrMetLeuIleGlnProMet 

1351 GCAGCAGA6GCAGCCTCCTAGC6TCCTGGCTGGGCCTGGTCCCAG 
AlaAlaGluAlaAlaSer 



1396 GCCCAC6AAAGAC6GTGACTCTTGGCTCTGCCC6AGGATGTGGCC 

1441 GTTCCCTGCCTGGGCAGGGGCTCCAAGGAGGGGCCATCTGGAAAC 

I486 TT6TGGACAGA6AAGAA6ACCACGACTGGA6AAGCCCCCTTTCTG 

1531 AGTGCAGGG6GGCTGCATGC6TTGCCTCCTGAGATC6AGGCTGCA 

1576 GGATATGCTCA6ACTCTAGAGGCGTGGACCAAGGGGCATGGAGCT 

1621 TCACTCCTTGCTGGCCAGG6AGTTGGG6ACTCAGAGGGACCACTT 

1666 GGGGCCAGCC AGACTGGCCTCAATGGCGGACTC AGTCAC ATT6AC 

1711 T6ACGGGGACCAGGGCTTGTGTGG6TCGAGAGCGCCCTCATGGTG 

1756 CTGGTGCT6TTGTGTGTAGGTCCCCTGGGGACACAAGCAGGCGCC 

1801 AATGGTATCTGGGCGGAGCTCACA6AGTTCTTGGAATAAAAGCAA 

1846 CCTCAGAACA 
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1 GGTAGCCGACGCGCCGGCCGGCGCGTGACCTTGCCCCTCTTGCTC 

46 GCCTTGAAIUITGGAAAAGATGCTCGCAGGCTGCTTTCTGCTGATC 

MetGluLysMetLeuAlaGlyCysPheLeuLeuIle 

9 1 CTCGGACAGATCGTCCTCCTCCCTGCCGAGGCCAGGGAGCGGTCA 
LeuGlyGlnlleValLeuLeuProAlaGluAlaArgGluArgSer 

13 6 CGTGGGAGGTCCATCTCTAG6GGCAGACACGCTCG6ACCCACCCG 
ArgGlyArgSerl leSerArgGlyArgHi sAlaArgThrHi sPro 

181 CAGACGGCCCTTCT6GAGAGTTCCTGTGAGAACAAGCGGGCAGAC 
GlnThrAlaLeuLeuGluSerSerCysGluAsnLysArgAlaAsp 

226 CTGGTTTTCATCATTGACAGCTCTC6CAGTGTCAACACCCATGAC 
LeuValPhellelleAspSerSerArgSerValAsnThrHisAsp 

271 TATGCAAAG6TCAAGGAGTTCATCGTGGACATCTTGCAATTCTTG 
TyrAlaLysValLysGluPhelleValAspIleLeuGlnPheLeu 

3 16 GACATTG6TCCTGATGTCACCCGAGTGGGCCTGCTCCAATATGGC 
AspIleGlyProAspValThrArgValGlyLeuLeuGlnTyrGly 

3 61 AGCACTGTCAAGAATGAGTTCTCCCTCAA6ACCTTCAAGA6GAAG 
SerThrValLysAsnGluPheSerLeuLysThrPheLysArgLys 

406 TCCGAGGTGGA6CGTGCT6TCAAGAG6AT6CGGCATCTGTCCACG 
SerGluValGluArgAlaValLysArgMetArgHisLeuSexThr 

451 GGCACCATGACTGGGCTGGCCATCCA6TATGCCCTGAACATC6CA 
GlyThrMe tThrGlyLeuAlal leGlnTyrAlaLeuAsnl leAla 

496 TTCTCAGAAGC AGAGGGGGCCCGGCCCCTGAGGGAGAATGTGCCA 
PheSerGluAlaGluGlyAlaArgProLexiArgGliiAsnValPro 

541 CGGGTCATAATGATCGTGACGGATGGGAGACCTCAGGACTCCGTG 
ArgVallleMetlleValThrAspGlyArgProGlnAspSerVal 

586 GCCGAGGTGGCTGCTAAGGCACGGGACACGGGCATCCTAATCTTT 
AlaGluValAlaAlaLysAlaArgAspThrGlyl leLeuI lePhe 

631 GCCATTGGTGTGGGCCAG6TAGACTTCAACACCTTGAAGTCCATT 
AlalleGlyValGlyGlnValAspPheAsnThrLeuLysSerlle 

67 6 GGGAGTGAGCCCCATGAGGACCATGTCTTCCTTGTGGCCAATTTC 
GlySerGluProHi sGluAspHi sValPheLeuVal AlaAsnPhe 

721 AGCCAGATTGAGACGCT6ACCTCCGT6TTCCAGAAGAAGTTGT6C 
SerGlnlleGluTlirlieuThrSerValPheGlnLysLysLeuCys 

Fig. 4 
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766 ACGGCCCACATGTGCAGCACCCTGGAGCATAACTGTGCCCACTTC 
ThrAlaHisMetCysSerThrLeuGluHisAsnCysAlaHisPhe 

811 TGCATC AAC ATCCCTGGCTCATACGTCTGC AGGTGCAAAC AAGGC 
CysIleAsnlleProGlySerTyrValCysArgCysLysGlnGly 

856 TAG ATTCTC AACTCGGATC AGACGACTTGCAGAATCCAGGATCTG 
TyrlleLeuAsnSerAspGlnThrThrCysArglleGlnAspLeu 

901 TGTGCCATGGAGGACCACAACTGTGAGCAGCTCTGTGTGAATGTG 
CysAlaMetGluAspHisAsnCysGluGlnLeuCysValAsnVal 

946 CCGGGCTCCTTCGTCTGCGAGTGCTACAGTGGCTACGCCCTGGCT 
ProGlySerPheValCy sGluCy sTyr SerGlyTyr Al aLeuAla 

991 GAGGATGGGAAGAGGTGTGTGGCTGTGGACTACTGTGCCTCAGAA 
GlxiAspGlyLysArgCysValAlaValAspTyrCysAlaSerGlu 

1036 AACCACGGATGTGAACATGAGTGTGTAAATGCTGATGGCTCCTAC 
AsnHisGlyCysGluHisGluCysValAsnAlaAspGlySerTyr 

1081 CTTTGCCAGTGCCATGAAGGATTTGCTCTTAACCCAGATGAAAAA 
LeuCysGlnCysHisGluGlyPheAlaLexiAsnProAspGluLys 

112 6 ACGTGCACAAAGATAGACTACTGTGCCTCATCTAATCATG6ATGT 
ThrCysThrLysIleAspTyrCysAlaSerSerAsnHisGlyCys 

117 1 CAGTAC6AGTGTGTTAACACA6ATGATTCCTATTCCTGCCACTGC 
GlnTyrGluCysValAsnThrAspAspSerTyrSerCysHisCys 

1216 CTGAAAGGCTTTACCCTGAATCCAGATAA6AAAACCTGCAGAAGG 
LeuLysGlyPheThrLeuAsnProAspLysLysThrCysArgArg 

1261 ATCAACTACTGTGCACTGAACAAACCGGGCTGTGAGCATGAGTGC 
I leAsnTyrCy sAl aLeixAsnLy sProGlyCy sGluHi sGluCy s 

1306 GTCAACATGGAGGAGAGCTACTACTGCCGCTGCCACCGTGGCTAC 
ValAsnMetGluGluSerTyrTya^^^ysArgCysHisArgGlyTyr 

1351 ACTCTGGACCCCAATGGCAAACCCTGCAGCCGAGTGGACCACTGT 
ThrLeuAspProAsnGlyLysProCysSerArgValAspHisCys 

1396 GCACAGCAGGACCATGGCTGTGAGCAGCTGTGTCTGAACAC6GAG 
AlaGlnGlnAspHisGlyCysGluGlnlfeuCysLeuAsnThrGlu 

144 1 GATTCCTTCGTCTGCCAGTGCTCAGAAGGCTTCCTCATCAAC6AG 
AspSerPheValCysGlnCysSerGluGlyPheLeuIleAsnGlu 
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1531 CAT6GTTGTGAATACTCCTGTGTCAACATGGACAGATCCTTTGCC 
HisGlyCysGluTyrSerCysValAsnMetAspArgSerPheAla 



1576 TGTCAGTGTCCTGAGGGACACGTGCTCCGCAGCGATGGGAAGACG 
CysGlnCysProGluGlyHisValLeiiArgSerAspGlyLysThr 

1621 TGT6CAAAATT6GACTCTT6TGCTCT6G6GGACCAC6GTT6TGAA 
CysAlaLysLeuAspSerCysAlaLeuGlyAspHisGlyCysGlu 

1666 CATTCGTGTGTAAGCAGT6AAGATTCGTTTGTGTGCCAGTGCTTT 
HlsSerCysValSerSerGluAspSerPheValCysGlnCysPhe 

17 11 GAAGGTTATATACTCCGTGAAGATGGAAAAACCTGCAGAAGGAAA 
GluGlyTyrlleLeuArgGluAspGlyLysThrCysArgArgLys 

1756 GATGTCTGCCAAGCTATAGACCATGGCTGTGAACACATTTGTGTG 
AspValCy sGlxiAl al leAspHl sGlyCy sGluHi s II eCy sVal 

1801 AACAGTGACGACTCATACACGTGCGAGTGCTTGGAGGGATTCCGG 
AsnSerAspAspSerTyr^hrCysGluCysLeuGluGlyPheArg 

1846 CTCACTGAGGATGGGAAACGCTGCCGAATTTCCTCAGGGAA6GAT 
LeuThrGluAspGlyLysArgCysArglleSerSerGlyLysAsp 

1891 GTCTGC AAATCAACCC ACCATGGCTGC6AACACATTTGTGTTAAT 
ValCy sLy s SerThrHisHi sGlyCy sGluHi s IleCysValAsn 

1936 AATGGGAATTCCTACATCTGCAAATGCTCAGAGGGATTTGTTCTA 
AsnGlyAsnSerTyrlleCysLysCysSerGluGlyPheValLeu 

1981 GCTGAGGACGGAAGACGGTGCAAGAAATGCACTGAA6GCCCAATT 
AlaGliiAspGlyArgArgCysLysLysCysThrGluGlyProIle 

2026 GACCTGGTCTTT6TGATCGATGGATCCAA6AGTCTTGGAGAAGAG 
AspLeuValPheVallleAspGlySerLysSerLeuGlyGluGlu 

2071 AATTTTGAGGTCGTGAAGCAGTTTGTCACTGGAATTATAGATTCC 
AsnPheGluValValLysGlnPheValThrGlyllel leAspSer 

2116 TTGACAATTTCCCCCAAAGCCGCTC6AGTGGGGCTGCTCCAGTAT 
LeuThrlleSerProLysAlaAlaArgValGlyLeuLeuGlnTyr 

2161 TCC AC ACAGGTCCAC ACAGAGTTCACTCTGAGAAACTTCAACTCA 
SerThrGlnValHisThrGluPheThrlteuArgAsnPheAsnSer 

2206 GCCAAAGACATGAAAAAAGCCGTGGCCCACATGAAATACATGGGA 
AlaLysAspMetLysLysAlaValAlaHisMetLysTyrMetGly 
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2251 AAG6GCTCTATGACTGGGCTGGCCCTGAAACACATGTTTGAGAGA 
LysGlySerMetThrGlyLeuAlaLeuLysHisMetPheGluArg 

2296 A6TTTTACCCAAG6AGAAGGG6CCA6GCCCCTTTTCCACAAGG6T 
SerPheThrGlnGlyGluGlyAlaArgProLeuPheHisLysGly 

2341 GCCCAGAGCAGCCATTGTGTTC ACCGACGGACGGGCTCAGGATGA 
AlaGlnSerSerHi sCy sValHi s ArgArgThrGlySerGly 

2386 CGTCTCC6A6TGGGCCAGTAAAGCCAAGGCCAATGGTATCACTAT 

2431 6TATGCT6TT66G6TAGGAAAAGCCATTGAGGAGGAACTACAAGA 

2476 GATTGCCTCTGAGCCCACAAACAAGCATCTCTTCTATGCCGAAGA 

2521 CTTCAGCACAATGGATGAGATAAGTGAAATyilCTCAAGAAAGGCAT 

2566 CTGTGAAGCTCTAGAAGACTCCGATGGAAGACAGGACTCTCCAGC 

2611 AGGG6AACT6CCAAAAAC66TCCAACAGCCAACAGAATCTGAGCC 

2656 AGTCACCATAAATATCCAAGACCTACTTTCCTGTTCTAATTTTGC 

2701 AGTGCAACAC AGATATCTGTTTGAAGAAGACAATCTTTTACGGTC 

2746 TACACAAAAGCTTTCCCATTCAACAAAACCTTCAGGAAGCCCTTT 

2791 G6AAGAAAAACACGATC AATGCAAATGTGAAAACCTTATAATGTT 

2836 CCAGAACCTTGC AAACGAAGAAGTAAGAAAATTTACACAGCGCTT 

2881 AGAAGAAATGACACA6AGAATGGAAGCCCTGGAAAATCGCCTGA6 

2926 ATACA6ATGAAGATTA6AAATCGCGACACATTT6TA6TCATTGTA 

2971 TCACGGATTACAATGAACGCA6TGCAGAGCCCCAAAGCTCAGGCT 

3016 ATTGTTAAATC 
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GGTAGCCGACGCGCCGGCCGGCGCGT6ACCTTGCCCCTCTTGCTC 

GCCTTGAAAATGGAAAAGATGCTCGCAGGCTGCTTTCTGCTGATC 
MetGluLysMetLeuAlaGlyCysPheLeuLeuXle 

CTCGGACAGATC6TCCTCCTCCCCT6CGAGGCCAGG6AGCG6TCA 
LeuGlyGlnlleValLeuLeuProCysGluAlaArgGluArgSer 

C6TGGGAGGTCCATCTCTAGG66CAGACACGCTCGGACCCACCCG 
ArgGlyArgSerlleSerArgGlyArgHisAlaArgThrHisPro 

CAGACGGCCCTTCTGGAGAGTTCCTGTGAGAACAAGCGGGCAGAC 
GlnThrAlaLeuLeuGluSerSerCysGluAsnLysArgAlaASp 

CTGGTTTTCATCATT6ACAGCTCTCGCAGTGTCAACACCCATGAC 
LeuValPhel 1 e 1 1 eAspSer Ser ArgSerVal AsnThrHi s Asp 

TATGCAAAGGTCAAGGAGTTCATCGTGGACATCTTGCAATTCTTG 
TyrAlaLysValLysGluPhelleValAspIleLeuGlnPhelieu 

GACATTGGTCCTGATGTCACCC6AGTGGGCCTGCTCCAATATGGC 
AspIleGlyProAspValThrArgValGlyLeuLeuGlnTyrGly 

AGCACTGTCAAG2UIT6AGTTCTCCCTCAAGACCTTCAAGAGG2VAG 
SerThrValLysAsnGluPheSerLeuLysThrPheLysArgLys 

TCCGAGGTGGAGCGTGCTGTCAAGAGGATGC6GCATCTGTCCACG 
SerGluValGluArgAlaValLysArgMetArgHisLeuSerThr 

GGCACCATGACTGGGCTGGCCATCCAGTATGCCCTGAACATCGCA 
GlyThrMetThrGlyLeuAlalleGlnTyrAlaLeuAsnlleAla 

TTCTCAGAAGCA6AGGGGGCCC6GCCCCTGAGGGAGAATGTGCCA 
PheSerGluAlaGluGlyAlaArgProLeuArgGluAsnValPro 

CGGGTCATAATGATCGTGACGGATGGGAGACCTCAGGACTCCGTG 
ArgVallleMetlleValThrAspGlyArgProGlziAspSerVal 

GCCGAGGTGGCTGCTAAGGCACGGGACACGGGCATCCTAATCTTT 
AlaGluValAlaAlaLysAlaArgAspThrGlyl leLeuI lePhe 

GCCATTGGT6TGGGCCAG6TAGACTTCAACACCTTGAA6TCCATT 
Alal leGlyValGlyGlnValAspPheAsnThrLeuLysSer lie 

GG6AGTGAGCCCCAT6AGGACCATGTCTTCCTTGTGGCCAATTTC 
GlySerGluProHisGliiAspHisValPheLeuValAlaAsnPhe 

AGCCAGATTGAGACGCTGACCTCCGTGTTCCAGAAGAAGTTGTGC 
SerGlnlleGluThrLeuThrSerValPheGlnLysliysLeuCys 
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766 ACGGCCCACATGTGCAGCACCCTGGAGCATAACTGTGCCCACTTC 
ThrAlaHisMetCysSerThrLeuGluHisAsnCysAlaHisPhe 

811 TGCATCAACATCCCTGGCTC ATACGTCTGCAGGTGCAAAC AAGGC 
CysIleAsnlleProGlySerTyrValCysArgCysLysGlnGly 

856 TACATTCTCAACTCGGATC AGACGACTTGCAGAATCCAGGATCTG 
TyrlleLeuAsnSerAspGlnThrThrCysArglleGlnAspLeu 

901 T6TGCCATGGAGGACCACAACT6TGAGCAGCTCT6TGTGAATGTG 
CysAlaMetGliiAspHisAsnCysGluGlnLeuCysValAsnVal 

946 CCGGGCTCCTTCGTCTGCGAGTGCTACAGTGGCTACGCCCTGGCT 
ProGlySerPheValCysGluCysTyrSerGlyTyrAlaLeuAla 

991 GAGGAT6GGAA6A66T6TGTGGCT6TGGACTACTGTGCCTCA6AA 
GluAspGlyLysArgCysValAlaValAspTyrCysAlaSerGlu 

1036 AACCAC6GAT6TGAACATGAGTGTGTAAATGCTGATGGCTCCTAC 
AsnHisGlyCysGluHisGluCysValAsnAlaAspGlySerTyr 

1081 CTTTGCCAGTGCCAT6AAGGATTTGCTCTTAACGCAGATGAAAAA 
LeuCy sGlnCy sHi sGluGlyPheAlaLeuAsnProAspGluLys 

1126 ACGTGCACAAAGATAGACTACTGTGCCTCATCTAATCATGGAT6T 
ThrCysThrLysIleAspTyrCysAlaSerSerAsnHisGlyCys 

117 1 CAGTACGAGTGTGTTAACACAGATGATTCCTATTCCTGCCACTGC 
GlnTyrGluCysValAsnThrAspAspSerTyrSerCysHlsCys 

1216 CTGAAAGGCTTTACCCTGAATCCAGATAAGAAAACCTGCAGAAGG 
LeuLysGlyPheThrLeuAsnProAspLysLysThrCysArgArg 

1261 ATCAACTACTGTGCACTGAACAAACCGGGCTGTGAGCATGAGTGC 
1 1 eAsnTyrCy sAlaLeuAsnLy sProGlyCy sGluHi sGluCy s 

1306 GTCAACATGGAGGAGAGCTACTACTGCCGCTGCCACCGTGGCTAC 
ValAsnMetGluGluSerTyrTyrCysArgCysHisArgGlyTyr 

1351 ACTCTGGACCCCAATGGCAAACCCTGCAGCCGAGTGGACCACTGT 
ThrLeuAspProAsnGlylfysProCysSerArgValAspHisCys 

1396 GCACAGCAGGACCATGGCTGTGAGCAGCTGTGTCTGAACACGGAG 
AlaGlnGlnAspHisGlyCysGluGlnLeuCysLeuAsnThrGlu 

144 1 6ATTCCTTCGTCTGCCAGTGCTCAGAAGGCTTCCTCATCAACGAG 
AspSerPheValCysGlnCysSezrGluGlyPheLeuIleAsnGlu 

1486 GACCTCAA6ACCTGCTCCCGGGTGGATTACTGCCTGCTGAGTGAC 
AspLeuLysThrCysSerArgValAspTyrCysLeuLeuSerAsp 
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1531 CATGGTTGT6AATACTCCTGTGTCAACATGGACA6ATCCTTTGCC 
HisGlyCysGluTyrSerCysValAsnMetAspArgSerPheAla 

1576 TGTCAGTGTCCTGAGGGACACGTGCTCCGCAGCGATGGGAA6ACG 
CysGlnCysProGluGlyHisValLeuArgSerAspGlyLysThr 

1621 TGTGCAAAATT6GACTCTTGTGCTCTGGGGGACCACGGTTGTGAA 
Cy sAl aLy sLeiiAspSerCysAlaLeuGlyAspHi sGlyCy sGlu 

1666 CATTCGTGTGTAAGC AGTGAA6ATTCGTTTGTGT6CCAGTGCTTT 
HisSerCysValSerSerGluAspSerPheValCysGlnCysPhe 

17 11 GAAGGTTATATACTCCGTGAAGATGGAAAAACCTGCAGAAGGAAA 
GluGlyTyrlleLeuArgGlxiAspGlyLysThrCysArgArgLys 

17 5 6 GATGTCTGCCAAGCTATA6ACCAT6GCTGTGAACACATTTGTGTG 
AspValCy sGlnAl al leAspHi sGlyCysGluHi s I leCy sVal 

1801 AACAGTGACGACTCATACACGTGCGAGTGCTTGGAGGGATTCCGG 
AsnSerAspAspSerTyrThrCysGluCysLeuGluGlyPheArg 

1846 CTCACTGAGGATGGGAAACGCTGCCGAATTTCCTCAGGGAAGGAT 
LeuThziGluAspGlyLysArgCysArglleSerSerGlyLysAsp 

1891 GTCTGCAAATCAACCCACCATGGCTGCGAACACATTTGTGTTAAT 
ValCy sLy sSerThrHi sHi sGlyCy sGluHi s I leCy sVal Asn 

1936 AATGGGAATTCCTACATCTGCAAATGCTCAGAG6GATTTGTTCTA 
AsnGlyAsnSerTyrlleCysLysCysSerGluGlyPheValLeu 

1981 GCTGAGGACGGAAGAC6GTGCAA6AAATGCACTGAAGGCCCAATT 
AlaGliLAspGlyArgArgCysLysLysCysThrGluGlyProIle 

2026 GACCTGGTCTTTGTGATCGATGGATCCAA6AGTCTTGGAGAAGAG 
AspLeuValPheVallleAspGlySerLysSerLeuGlyGluGlu 

2071 AATTTTGAG6TC6T6AAGCA6TTTGTCACTG6AATTATAGATTCC 
AsnPheGluValValLysGlnPheValThrGlyllelleAspSer 

2116 TTGACAATTTCCCCCAAAGCCGCTCGAGTGGGGCTGCTCCAGTAT 
LeuThrlleSerProLysAlaAlaArgValGlyLeuLeuGlnTyz' 

2161 TCCACACAGGTCCACACAGAGTTCACTCTGAGAAACTTCAACTCA 
SerThrGlnValHisThrGluPheThrLeuArgAsnPheAsnSer 

2206 GCCAAA6ACATGAAAAAA6CCGT6GCCCACAT6AAATACATGG6A 
AlaLysAspMetLysLysAlaValAlaHisMetLysTyrMetGly 
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2251 AAGGGCTCTATGACTGGGCTGGCCCTGAAACACATGTTTGAGAGA 
LysGlySerMetThrGlyLeuAlaLeuLysHisMetPheGlxiArg 

2296 AGTTTTACCCAAGGAGAAGGGGCC AGGCCCTTTTCC ACAAGGGTG 
SerPheThrGlnGlyGluGlyAlaArgProPheSerThrArgVal 

2341 CCCAGAGCAGCCATT6TGTTCACCGACG6ACGGGCTCAG6ATGAC 
ProArgAlaAlalleValPheThrAspGlyArgAlaGlnAspAsp 

2386 GTCTCCGAGTGGGCC AGTAAAGCCAAGGCCAATGGTATCACTATG 
ValSerGluTrpAlaSerLysAlaLysAlaAsnGlylleThrMet 

2431 TAT6CT6TTGGGGTAG6AAAAGCCATTGAG6A6GAACTACAAGAG 
TyrAlaValGlyValGlyLysAlalleGluGluGluLeuGlnGlu 

2476 ATTGCCTCTGAGCCCACAAACAAGCATCTCTTCTATGCCGAAGAC 
IleAlaSerGluProThrAsnLysHisLeuPheTyrAlaGluAsp 

2521 TTCAGCACAATGGATGAGATAAGTGAAAAACTC AAGAAAGGCATC 
PheSerThrMetAspGluIleSerGluLysLeuLysLysGlylle 

2566 TGT6AAGCTCTAGAA6ACTCC6ATGGAAGACAGGACTCTCCAGCA 
CysGluAlaLeuGlxiAspSerAspGlyArgGlnAspSerProAla 

2611 GGGGAACTGCCAAAAACGGTCCAACAGCCAACA6AATCTGA6CCA 
GlyGluLeuProIiysThrValGlnGlnProThrGluSerGluPro 

2656 GTCACC ATAAATATCCAAGACCTACTTTCCTGTTCTAATTTTGCA 
ValThrlleAsnlleGlnAspLeuLeuSerCysSerAsnPheAla 

2701 GTGCAACACAGATATCTGTTTGAAGAAGACAATCTTTTACGGTCT 
ValGlnHisArgTyrLeuPheGluGltiAspAsnLeuLeuArgSer 

2746 ACAC AAAAGCTTTCCC ATTCAACAAAACCTTC AGGAAGCCCTTTG 
ThrGlnLysLeuSerHisSerThrLysProSerGlySerProLeu 

2791 GAAGAAAAACACGATCAATGCAAATGT6AAAACCTTATAATGTTC 
GluGluLysHisAspGlnCysLysCysGluAsnLeuIleMetPhe 

2836 CAGAACCTTGCAAACGAAGAAGTAAGAAAATTAACACAGCGCTTA 
GlnAsnLeuAlaAsnGluGluValArgLysLeuThrGlnArgLeu 

2881 GAAGAAATGACACAGA6AATGGAAGCCCTG6AAAATCGCCTGA6A 
GluGluMetThrGlxiArgMetGluAlaLeuGliiAsnArgLeuArg 

2926 TACA6AT6AAGATTAGAAATCGCGACACATTTGTAGTCATTGTAT 
TyrArg 
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CACGGATTACAATGAACGCAGTGCA6AGCCCCAAAGCTCAGGCTA 
TT6TTAAATCAATAAT6TT6TGAA6TAAAACAATCA6TACTGAGA 
AACCTGGTTTGCCACAGAACAAAGACAAGAAGTATACACTAACTT 
GTATAAATTTATCTA66AAA2UUU^TCCTTCA6AATTCTAAGATGA 
ATTTACCA6GTGA6AAT6AATAAGCTAT6CAA6GTATTTTGTAAT 
ATACTGTGGACACAACTTGCTTCT6CCTCATCCTGCCTTAGTGTG 
CAATCTCATTTGACTATACGATAAAGTTTGCACAGTCTTACTTCT 
GTAGAACACTGGCCATAGGAAATGCTGTTTTTTTGTACTGGACTT 
TACCTTGATATAT6TATATG6ATGTATGCATAAAATCATAGGACA 
TATGTACTTGTGGAACAAGTTGGATTTTTTATACAATATTAAAAT 
TCACCACTTCA6AGAAAA6T2VAAAAAA 



Fig. 5 (continued) 
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1 CGGCCCTTCTCACACTCCTGCCCTGCTGAT6TGGAAC66GGTTTG 
46 GGGTTCTGCAGGGCTATTGTCTGCGCTGGGGAAGGGGACAGGCCG 
9 1 GGACCGGGACCTCCGCTCGCAGCCGGCCGCACCAGCA6GACAGCT 

136 GGCCT6AAGCTCAGAGCCGGGGCGTGCGCCATGGCCCCACACTGG 



181 GCTGTCTGGCTGCTGGCAGCAAGGCTGTGGGGCCTGGGCATTG6G 
AlaValTrpLeuLeuAlaAlaArgLeuTrpGlyLeuGlylleGly 

226 GCTGAGGTGTGGTGGAACCTTGTGCCGCGTAAGACAGTGTCTTCT 
AlaGluValTrpTiTpAsnLeuValProArgLysThrValSerSer 

271 GGGGAGCTGGCCACGGTAGTACG6CGGTTCTCCCAGACCGGCATC 
GlyGluLeuAlaThrValValArgArgPheSerGlnThrGlylle 

316 C AGGACTTCCTGACACT6AC6CTGACGGAGCCCACTGGGCTTCTG 
GlnAspPheLeuThrLeuThrLeuThrGluProThrGlyLeuIieu 

361 TACGTGGGCGCCCGA6AGGCCCTGTTTGCCTTCAGCATGGAGGCC 
TyrValGlyAlaArgGluAlaLeuPheAlaPheSerMetGluAla 

406 CTGGAGCTGCAAGGAGCGATCTCCTGGGAGGCCCCCGTG6AGAAG 
LeuGluLeuGlnGlyAlalleSerTrpGluAlaProValGluLys 

451 AAGACTGAGTGTATCCAGAAAGGGAAGAACAACCAGACCGA6TGC 
LysThrGluCysIleGlnLysGlyLysAsnAsnGlnThrGluCys 

496 TTCAACTTCATCCGCTTCCT6CAGCCCTACAATGCCTCCCACCTG 
PheAsnPhelleArgPheLeuGlnProTyrAsnAlaSerHi sLeu 

541 TACGTCTGTGGCACCTACGCCTTCCAGCCCAAGTGCACCTACGTC 
TyrValCysGlyThrTyrAlaPheGlnProLysCysThrTyrVal 

586 AACATGCTCACCTTCACTTTGGAGCATGGAGAGTTTGAAGATGGG 
AsnMetLeuThrPheThrLeuGluHisGlyGluPheGluAspGly 

631 AAGGGCAAGTGTCCCTATGACCCAGCTAAGGGCCATGCTGGCCTT 
LysGlyLysCysProTyrAspProAlaLysGlyHisAlaGlyLeu 

67 6 CTTGTGGATGGTGAGCTGTACTCGGCCACACTCAACAACTTCCT6 
LeuValAspGlyGluLeuTyrSerAlaThrLeuAsnAsnPheLeu 

721 GGCACGGAACCCATTATCCTGCGTAACATGGGGCCCCACCACTCC 
GlyThrGluProI lei 1 eLeuArgAsnMe tGlyProHi sHi sSer 



MetAlaProHisTrp 




23/35 



766 ATGAAGACAGAGTACCTGGCCTTTTGGCTCAACGAACCTCACTTT 
MetLysThrGluTyrLeuAlaPheTznpLeuAsnGluProHisPhe 

811 GTAGGCTCTGCCTATGTACCTGAGAGGGTGGGCCTGCTGTGGACA 
ValGlySerAlaTyrValProGluArgValGlyLeuLeuTrpThr 

856 ATGGCATACTCTCTTCCAGCCCTAGGAGGAGGGCTCCTAACAGTG 
MetAlaTyrSerLeuProAlaLeuGlyGlyGlyLeuLeuThrVal 

901 TAACTTATTGTGTCCCCGCGTATTTATTTGTTGTAAATATTTGAG 
946 TATTTTTATATTGACAAATAAA 



Fig. 6 (continued) 
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L GGCACCA66CCTTCCG6A6A6AC6CA6TCGGCT6CCACCCC6GGA 

M 

4 6 TGGGTCGCTGGTGCCAGACCGTCGCGCGCGGGCAGCGCCCCCGGA 
etGlyArgTjqpCysGlnThrValAlaArgGlyGlnArgProArgT 

9 1 CGTCTGCCCCCTCCCGCGCCGGTGCCCTGCTGCTGCTGCTTCTGT 
hrSerAlaProSerArgAlaGlyAlaLeuLeuLeuLeuLeuLeuL 

136 TGCT6A66TCTGCAGGTTGCTGGGGCGCAGGGGAAGCCCCGGGGG 
euLeuArgSerAlaGlyCysTrpGlyAlaGlyGluAlaProGlyA 

181 CGCTGTCCACTGCTGATCCCGCCGACCAGAGCGTCCAGTGTGTCC 
laLeuSerThrAlaAspProAlaAspGlnSerValGlnCysValP 

226 CCAAGGCCACCTGTCCTTCCAGCCGGCCTCGCCTTCTCTGGCAGA 
roLysAlaThrCysProSerSerArgProArgLeuLeuTrpGlnT 

271 CCCCGACCACCCAGACACTGCCCTCGACC ACC ATGGAGACCCAAT 
hrProThrThrGlnThrLeuProSerThrThrMetGluThrGlnP 

3 16 TCCCAGTTTCTGAAGGCAAAGTCGACCCATACCGCTCCTGTGGCT 
heProValSerGluGlyLysValAspProTyrArgSerCysGlyP 

361 TTTCCTACGA6CAGGACCCCACCCTCAG6GACCCAGAAGCCGTGG 
heSerTyrGluGlnAspProThrLeuArgAspProGluAlaValA 

406 CTCGGCGGTGGCCCTGGATGGTCAGCGTGCGGGCCAATGGCACAC 
laArgArgTrpProTrpMetValSerValArgAlaAsnGlyThrH 

451 ACATCTGTGCCGGCACCATCATTGCCTCCCAGTGGGTGCTGACTG 
isIleCysAlaGlyThrllelleAlaSerGlnTrpValLeuThrV 

496 TGGCCCACTGCCTGATCTGGCGTGATGTTATCTACTCAGTGAGGG 
alAlaHi sCy sLeuI 1 eTrpArgAspVal 1 1 eTy r SerVal ArgV 

541 TGGGGAGTCCGTGGATTGACCAGATGACGCAGACCGCCTCCGATG 
alGlySerProTrplleAspGlnMetThrGlnThrAlaSerAspV 

586 TCCCGGTGCTCCAGGTCATCATGCATAGCAGGTACCGGGCCCAGC 
alProValLeuGlnVal 1 1 eMetHi sSer ArgTyr ArgAlaGlxiA 

631 GGTTCT66TCCTGGGTG6GCCAGGCCAACGACATCGGCCTCCTCA 
rgPheTrpSerTrpValGlyGlnAl aAsnAspI 1 eGlyLeuLeuL 

676 AGCTCAAGCAGGAACTCAAGTACAGCAATTACGTGCGGCCCATCT 
ysLeuLysGlnGluLeuLysTyrSerAsxiTyrValArgProIleC 



Fig. 7 
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721 GCCTGCCTGGCACGGACTATGTGTTGAAGGACCATTCCCGCTGCA 
ysLeuProGlyThrAspTyrValLeuLysAspHlsSerArgCysT 

766 CT6TGACG6GCTGGGGACTTTCCAAGGCT6ACGGCATGTGGCCTC 
hrValThrGlyTrpGlyLeuSerLysAlcLAspGlyMetTrpProG 

811 A6TTCC6GACCATTC AGGAGAA6GAAGTCATCATCCTGAACAACA 
InPheArgThrlleGlnGluLysGluValllelleLeuAsnAsnL 

856 AAGAGTGT6ACAATTTCTACCACAACTTCACCAAAATCCCCACTC 
ysGluCysAspAsnPheTyrHisAsnPheThrLysIleProThrL 

901 TGGTTCAGATCATCAAGTCCCAGATGAT6TGTGCG6AGGACACCC 
euValGlnllelleliysSerGlnMetMetCysAlaGluAspThrH 

946 ACA6GGA6AA6TTCTGCTAT6AGCTAACTG6AGAGCCCTTGGTCT 
IsArgGluLysPheCysTyrGluLeuThrGlyGluProLeuValC 

991 GCTCCAT66AG6GCACGTGGTACCTGGTGGGATTGGTGA6CTGG6 
ysSerHetGluGlyThrTrpTyrLeuValGlyLeuValSerTrpG 

1036 GTGCAGGCT6CCAGAAGAGC6A6GCCCCACCCATCTACCTACAG6 
lyAlaGlyCysGlnLysSerGluAlaProProIleTyrLeuGlnV 

1081 TCTCCTCCTACCAACACTGGATCTGG6ACTGCCTCAACGGGCAGG 
al Ser SerTyirGlnHi sTrpIl eTrpAspCysLeuAsnGlyGlnA 

1126 CCCTGGCCCTGCCAGCCCCATCCAGGACCCT6CTCCTGGCACTCC 
laLeuAlaLeuProAlaProSerArgThrLeuLeuLeuAlaLeuP 

1171 CACTGCCCCTCAGCCTCCTT6CTGCCCTCTGACTCTGTGTGCCCT 
roLeuProLeuSerLeuLeuAlaAlaLeu 

1216 CCCTCACTTGTGGGCCCCCCTTGCCTCCGT6CCCAGGTTGCTGTG 

1261 GGTGCAGCTGTCACAGCCCTGA6AGTCAGGGTGGA6ATGAGGTGC 

1306 TCAATTAAACATTACTGTTTTCCATGTAAAAAAAAAAAAAAAAAA 

1351 AAAAAAAAA 



Fig. 7 (continued) 




AGTGATGGTMGTGCTGGCCCA6ACTGAAGCTCGGAGAGGCACTCTGCTTGCCCAGCGTCACA6TC 
161 

TCCTGGCTTCCAGTCTCCCTTGCTTCCa^GATCOykG&CPCTAGrc 
241 



TAC6CAATCTGCGCCTGCGTCTCATCAGTCGCCCCACATGTAACT6TATCTACA2^CA6CTG^ 
321 

AACCXGGCCCGGCCTGGGATGCTATGTGGGGGCCCCaGCCTGGGGTGCAGGGCCCC^^ 
401 

GAAGGAGCAGAAGGGGAGGGGCCTAACCCTGGGCTGGGGGTTGGACTCACAGGACT^ 
481 

GTGTCTGCCATAGCTGGGCTCAGGCATCTGTCCTTGGCTmTTGCCTGGCTCCAGGGAG^^ 
561 

TGOrrcGAGCCTGACGGACACTGGGTTCAGGCTGGCATaiTaGCTO^ 
641 

GCTGCmCCAAO^CAGCTGCTaCAGTTCCTGGCTGCAGGCTCGAGTTCAGGG® 
721 

AGACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGATCCTTGAGGACAGCAGGTC^ 
HetSerAspGluAspSeiCysValAlaCysGlySerLeuArgThrAlaGlyProGlnAlaGlyAlaPro 
801 

TCCCCATGGCCCTGGGAGGCCAGGCTGATGOICCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTC 
SerProTinpProTipGluAlaArgLeuMetHisGlnGlyGlnl^QAlaCysGlyGlyAlaLettValSerGluGluA^ 
881 

GCTAACT6CTG(X:CACTGCTTCATT6GGCGCCA6GCCCCA6AGGAATGGAGCGTAGGGCT^^ 
lLeu11irAlaAlaHisCysPheIle61yArgGlnAlaProGlu61uTipSerVal61yI.eu61yTfarArgPro61u6luT 
961 

GGG6CCT6AAGCAGCTCATCCTGCATG6AGCCTACA(X:CACCCT6A66GGG6CTACGACATGG^ 
ipGlyLeuLysGlnLeuIleLeuHisGlyAlaTyrlltoBisProGluGlyGlyTyrAsiflletJdaL^ 
1041 

CAGCCT6TGACACTGG6AGCCAGCCTGCGGCCCCTCTGCCTGCCCTATGCTGACCACCACCTGC 
GlnProVallbrLeuGlyAlaSerLeiiArgProLeuCysLeaProTyrAlaAspHisHisLettProAspGlyGlt^ 
1121 

CTGGGTTCTGGGACGGGCaGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGT^ 
yTipValLeuGlyArgAlaArgProGlyAlaGlylleSerSerLeuGlnThrValProValllirLeuLeuGlyProArgA 
1201 

CCTGCAGCCm:TGCATGCAGCTCCTGGGGGTGATGGCA6CCCTATTCT^ 

laCysSerArgLeuHisAlaAlaProGlyGlyAspGlySerProIleLeuProGlyHemlCysThrSerAlaValGly 

1281 

GAGCTGCCCAGCTGTGAGGTGAGCCCCAGGCCCCCAOlCCmCCTAACAGGCCCCTGGCATCCCCT^ 
GliiLeuProSerCysGluValSerProArgProProHisLeuThr 
1361 

AA6AACGGACCTT(X:AGGCTTGGCCTCTGGACCCACCTCCCACCT6AAGCTA^ 
1441 

GCCA6 

Fig. 8 
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1 CTTAACAGCCACTT6TTTCATCCCACCTGGGCATTAGGTTGACTT 

46 CAAA6ATGCCTCA6TTACTGCAAAACATTAATGG6ATCATC6AG6 
Met ProGlnLeuLeuGlnAsnl leAsnGlyl lei leGluA 



9 1 CCTTCAGGCGCTAT6CAAGGACGGAGGGCAACTGCACAGCGCTCA 
laPheArgArgTyrAlaArgThrGluGlyAsnCysThrAlaLeuT 

13 6 CCCGAGGGGAGCT6AAAAGACTCTTGGAGCAAGAGTTTGCCGATG 
hrArgGlyGluLeuLysArgLeuLeuGluGlnGluPheAlaAspV 

181 T6ATTGTGAAACCCCACGATCCA6CAACTGTGGATGAG6TCCTGC 
allleValLysProHisAspProAlaThrValAspGluValLeuA 

226 GTCTGCTGGATGAAGACCACACAGGGACTGTG6AATTCAAGGAAT 
rgLeuLeuAspGluAspHisThrGlyThrValGluPheLysGluP 

271 TCCTG6TCTTAGTGTTTAAAGTT6CCCAGGCCTGTTTCAAGACAC 
heLeuValLeuValPheLysValAlaGlnAlaCysPheLysThrL 

316 T6AGCGAGAGTGCTGAGGGAGCCTGCGGCTCTCAAGAGTCTGGAA 
euSerGluSerAlaGluGlyAlaCysGlySerGlnGluSerGlyS 

361 GCCTCCACTCTGGGGCCTCGCA66AGCTGGGCGAAGGACAGAGAA 
erLeuHisSerGlyAlaSerGlnGluLeuGlyGluGlyGlnArgS 

406 GTGGCACTGAAGTGGGAAGGGCGGGGAAAGGGCAGCATTATGAGG 
erGlyThrGluValGlyArgAlaGlyLysGlyGlnHisTyrGluG 

451 G6AGCAGCCACAGACAGAGCCAGCAGGGTTCCA6AGGGCAGAACA 
lySerSerHisArgGlnSerGlnGlnGlySerArgGlyGlnAsnA 

496 GGCCTGGGGTTCAGACCCAGGGTCAGGCCACTGGCTCTGCGTGGG 
rgProGlyValGlnThrGlnGlyGlnAlaThrGlySerAlaTrpV 

541 TCAGCAGCTATGACAGGCAAGCTGAGTCCCAGAGCCAGGAAAGAA 
alSerSerTyrAspArgGlnAlaGluSerGlnSerGlnGluArgl 

586 TAAGCCCGCAGATACAACTCTCTGG6CAGACAGAGCA6ACCCAGA 
leSerProGlnlleGlnLeuSerGlyGlnThrGluGlnThrGlnL 

631 AAGCTGGA6AAGGCAAGAGGAATCA6ACAACAGAGATGAGGCCAG 
ysAlaGlyGluGlyLysArgAsnGlnThrThrGluMetArgProG 

67 6 AGAGACAGCCACA6ACCAGGGAACAGGACAGA6CCCACCAGACAG 
luArgGlnProGlnThrArgGluGlnAspArgAlaHisGlnThrG 



Fig. 9 
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721 GTGAGACTGT6ACTGGATCTGGAACTCAGACCCAGGCAGGTGCCA 
lyGluThrValThrGlySerGlyThrGlnThzrGlnAlaGlyAlaT 

766 CCC AGACTGTGGAGC AGGAC AGC AGCC ACCAGACAGGAAGC ACCA 
hrGlnThrValGluGlnAspSerSerHisGlnThrGlySerThrS 

811 GCACCCAGACACAGGAGTCCACCAATGGCCAGAACAGAGGGACTG 
erThrGlxiThrGlnGluSerThrAsnGlyGlnAsnArgGlyThrG 

856 AGATCCACGGTCAAGGCAG6AGCCAGACCAGCCAGGCTGTGACAG 
luIleHisGlyGlnGlyArgSerGlnThrSerGlnAlaValThrG 

901 GA66ACACACTCAGATACAGGCAGGGTCACACACC6AGACTGTGG 
lyGlyHisThrGlnlleGlnAlaGlySerHisThrGluThrValG 

946 AGCAGGACAGAAGCCAAACTGTAA6CCACGGAG6GGCTAGAGAAC 
luGlnAspArgSerGlnThrVal SerHi sGlyGlyAlaArgGluG 

991 AGGGACAGACCCAGACGCAGCCAGGCAGT6GTCAAAGATGGATGC 
InGlyGlnThrGlnThrGlnProGlySerGlyGlnArgTrpMetG 

1036 AAGTGAGCAACCCTGAGGCAGGA6AGACAGTACCGGGAGGACAGG 
InValSerAsnProGluAlaGlyGluThrValProGlyGlyGlnA 

1081 CCCAGACTGGG6CAAGCACTGA6TCAGGAAGGCAGGA6TGGAGCA 
laGlnThrGlyAlaSerThrGluSerGlyArgGlnGluTrpSerS 

1126 GCACTCACCCAAGGCGCTGTGTGACAGAAGGGCAGGGAGACAGAC 
erThrHisProArgArgCysValThrGluGlyGlnGlyAspArgG 

1171 AGCCCACAGT6GTTGGTGAGGAATGGGTTGATGACCACTCAAGG6 
InProThrValValGlyGluGluTrpValAspAspHisSerArgG 

1216 AGACAGTGATCCTCAGGCTGGACCAGGGCAACTTGCATACCAGTG 
luThrVal 1 1 eLeuArgLeuAspGlnGlyAsnLexiHi sThr SerV 

1261 TTTCCTCAGCACAGGGCCAGGAT6CAGCCCAGTCAGAAGAGAAGC 
alSerSerAlaGlnGlyGlnAspAlaAlaGlnSerGluGlxiLysA 

1306 GAG6CATCACAGCTAGAGAGCTGTATTCCTACTT6AGAAGCACCA 
rgGlylleThrAleLArgGluLeuTyrSexrTyrLeuArgSerThrL 

1351 AGCCATGACTTCCCCGACTCCAATGTCCAGTACTGGAAGAAGACA 
ysPro 

1396 GCTGGAGAGAGTTTGGCTTGTCCTGCATGGCCAATCCAGTGGGTG 

1441 CATCCCTGGACATCAGCTCTTCATTATGCAGCTTCCCTTTTAG6T 

I486 CTTTCTCAATGAGATAATTTCTGCAAGGAGCTTTCTATCCTGAAC 

1531 TCTTCTTTCTTACCTGCTTTGCG6TGCAGACCCTCTCAGGAGCA6 

1576 GAAGACTCAGAACAAGTCACCCCTT 



Fig. 9 (continued) 
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Figure 15. Nucleotide Sequence for CGI 063 18-01. 



>CG106318-01 4810 nt 

GTCCATGGGGCCGATGTATGGGAGATGAATGTGGTCCCGGAGGCATCCAAACGAGGGCTG 

TGTGGTGTGCTCATGTGGAGGGATGGACTACACTGCATACTAACTGTAAGCAGGCCGAGA 

GACCCAATAACCAGCAGAATTGTTTCAAAGTTTGCGATTGGCACAAAGAGTTGTACGACT 

GGAGACTGGGACCTTGGAATCAGTGTCAGCCCGTGATTTCAAAAAGCCTAGAGAAACCTC 

TTGAGTGCATTAAGGGGGAAGAAGGTATTCAGGTGAGGGAGATAGCGTGCATCCAGAAAG 

ACAAAGACATTCCTGCGGAGGATATCATCTGTGAGTACTTTGAGCCC AAGC CTCTCCTGG 

AGCAGGCTTGCCTCATTCCTTGCCAGCAAGATTGCATCGTGTCTGAATTTTCTGCCTGGT 

CCGAATGCTCCAAGACCTGCGGCAGCGGGCTCCAGCACCGGACGCGTCATGTGGTGGCGC 

CCCCGCAGTTCGGAGGCTCTGGCTGTCCAAACCTGACGGAGTTCCAGGTGTGCCAATCCA 

GTCCATGCGAGGCCGAGGAGCTCAGGTACAGCCTGCATGTGGGGCCCTGGAGCACCTGCT 

CAATGCCCCACTCCCGACAAGTAAGACAAGCAAGGAGACGCGGGAAGAATAAAGAACGGG 

AAAAGGACCGCAGCAAAGGAGTAAAGGATCCAGAAGCCCGCGAGCTTATTAAGAAAAAGA 

GAAACAGAAACAGGCAGAACAGACAAGAGAACAAATATTGGGACATCCAGATTGGATATC 

AGACCAGAGAGGTTATGTGCATTAACAAGACGGGGAAAGCTGCTGAmAAGCTTTTGCC 

AGCAAGAGAAGCTTCCAATGACCTTCCAGTCCTGTGTGATCACCAAAGAGTGCCAGGTTT 

CCGAGTGGTCAGAGTGGAGCCCCTGCTCAAAAACATGCCATGACATGGTGTCCCCTGCAG 

GCACTCGTGTAAGGACACGAACCATCAGGCAGTTTCCCATTGGCAGTGAAAAGGAGTGTC 

CAGAATTTGAAGAAAAAGAACCCTGTTTGTCTCAAGGAGATGGAGTTGTCCCCTGTGCCA 

CGTATGGCTGGAGAACTACAGAGTGGACTGAGTGCCGTGTGGACCCTTTGCTCAGTCAGG 

AGGACAAGAGGCGCGGCAACCAGACGGCCCTCTGTGGAGGGGGCATCCAGACCCGAGAGG 

TGTACTGCGTGCAGGCCAACGAAAACCTCCTCTCACAATTAAGTACCCACAAGAACAAAG 

AAGCCTCAAAGCCAATGGACTTAAAATTATGCACTGGACCTATCCCTAATACTACACAGC 

TGTGCCACATTCCTTGTCCAACTGAATGTGAAGTTTCACCTTGGTCAGCTTGGGGACCTT 

GTACTTATGAAAACTGTAATGATCAGCAAGGGAAAAAAGGCTTCAAACTGAGGAAGCGGC 

GCATTACCAATGAGCCCACTGGAGGCTCTGGGGTAACCGGAAACTGCCCTCACTTACTGG 

AAGCCATTCCCTGTGAAGAGCCTGCCTGTTATGACTGGAAAGCGGTGAGACTGGGAGACT 

GCGAGCCAGATAACGGAAAGGAGTGTGGTGCAGGCACGCAAGTTCAAGAGGTTGTGTGCA 

TCAACAGTGATGGAGAAGAAGTTGACAGACAGCTGTGCAGAGATGCCATCTTCCCCATCC 

CTGTGGCCTGTGATGCCCCATGCCCGAAAGACTGTGTGCTCAGCACATGGTCTACGTGGT 

CCTCCTGCTCACACACCTGCTCAGGGAAAACGACAGAAGGGAAACAGATACGAGCACGAT 

CCATTCTGGCCTATGCGGGTGAAGAAGGTGGAATTCGCTGTCCAAATAGCAGTGCTTTGC 

AAGAAGTACGAAGCTGTAATGAGCATCCTTGCACAGTGTACCACTGGCAAACTGGTCCCT 

GGGGCCAGTGCATTGAGGACACCTCAGTATCGTCCTTCAACACAACTACGACTTGGAATG 

GGGAGGCCTCCTGCTCTGTCGGCATGCAGACAAGAAAAGTCATCTGTGTGCGAGTCAATG 

TGGGCCAAGTGGGACCCAAAAAATGTCCTGAAAGCCTTCGACCTGAAACTGTAAGGCGTT 

GTCTGCTTCCTTGTAAGAAGGACTGTATrGTGACCCCATATAGTGACTGGACATCATGCC 

CCTCTTCGTGTAAAGAAGGGGACTCCAGTATCAGGAAGCAGTCTAGGCATCGGGTCATCA 

TTCAGCTGCCAGCCAACGGGGGCCGAGACTGCACAGATCCCCTCTAfGAAGAGAAGGCCT 

GTGAGGCACCTCAAGCGTGCCAAAGCTACAGGTGGAAGACTCACAAATGGCGCAGATGCC 

AATTAGTCCCrrGGAGCGTGCAACAAGACAGCCCTGGAGCACAGGAAGGCTGTGGGCCTG 

GGCGACAGGCAAGAGCCATTACTTGTCGCAAGCAAGATGGAGGACAGGCTGGAATCCATG 

AGTGCCTACAGTATGCAGGCCCTGTGCCAGCCCTTACCCAGGCCTGCCAGATCCCCTGCC 

AGGATGACTGTCAATTGACCAGCTGGTCCAAGTnTCTTCATGCAATGGAGACTGTGGTG 

CAGTTAGGACCAGAAAGCGCACTCTTGTTGGAAAAAGTAAAAAGAAGGAAAAATGTAAAA 

ATTCCCATTTGTATCCCCTGATTGAGACTCAGTATTGTCCTTGTGACAAATATAATGCAC 

AACCTGTGGGGAACTGGTCAGACTGTATTTTACCAGAGGGAAAAGTGGAAGTGTTGCTGG 

GAATGAAAGTACAAGGAGACATCAAGGAATGCGGACAAGGATATCGTTACCAAGCAATGG 

CATGCTACGATCAAAATGGCAGGCTTGTGGAAACATCTAGATGTAACAGCCATGGTTACA 

TTGAGGAGGCCTGCATCATCCCCTGCCCCTCAGACTGCAAGCTCAGTGAGTGGTCCAACT 

GGTCGCGCTGCAGCAAGTCCTGTGGGAGTGGTGTGAAGGTTCGTTCTAAATGGCTGCGTG 

AAAAACCATATAATGGAGGAAGGCCTTGCCCCAAACTGGACCATGTCAACCAGGCACAGG 

TGTATGAGGTTGTCCCATGCCACAGTGACTGCAACCAGTACCTATGGGTCACAGAGCCCT 

GGAGCATCTGCAAGGTGACCTTTGTGAATATGCGGGAGAACTGTGGAGAGGGCGTGCAAA 

CCCGAAAAGTGAGATGCATGCAGAATACAGCAGATGGCCGTTCTGAACATGTAGAGGATT 

ACCTCTGTGACCCAGAAGAGATGCCCCTGGGCTCTAGAGTGTGCAAATTACCATGCCCTG 

AGGACTGTGTGATATCTGAATGGGGTCCATGGACCCAATGTGTTTTGCCTTGCAATCAAA 

GCAGTTTCCGGCAAAGGTCAGCTGATCCCATCAGACAACCAGCTGATGAAGGAAGATCTT 

GCCCTAATGCTGTTGAGAAAGAACCCTGTAACCTGAACAAAAACTGCTACCACTATGATT 

ATAATGTAACAGACTGGAGTACATGTCAGCTGAGTGAGAAGGCAGTTTGTGGAAATGGAA 

TAAAAACAAGGATGTTGGATTGTGTTCGAAGTGATGGCAAGTCAGTTGACCTGAAATATT 



GTGAAGCGCTTGGCTTGGAGAAGAACTGGCAGATGAACACGTCCTGCATGGTGGAATGCC 

CTGTGAACTGTCAGCTTTCTGATTGGTCTCCTTGGTCAGAATGTTCTCAAACATGTGGCC 

TCACAGGAAAAATGATCCGAAGACGAACAGTGACCCAGCCCTTTCAAGGTGATGGAAGAC 

CATGCCCTTCCCTGATG6ACCAGTCCAAACCCTGCCCAGTGAAGCCTTGTTATCGGTGGC 

AATATGGCCAGTGGTCTCCATGCCAAGTGCAGGAGGCCCAGTGTGGAGAAGGGACCAGAA 

CAAGGAACATTTCTTGTGTAGTAAGTGATGGGTCAGCTGATGATTTCAGCAAAGTGGTGG 

ATGAGGAATTCTGTGCTGACATTGAACTCATTATAGATGGTAATAAAAATATGGTTCTGG 

AGGAATCCTGCAGCCAGCCTTGCCCAGGTGACTGTTATTTGAAGGACTGGTCTTCCTGGA 

GCCTGTGTCAGCTGACCTGTGTGAATGGTGAGGATCTAGGCTTTGGTGGAATACAGGTCA 

GATCCAGACCGGTGATTATACAAGAACTAGAGAATCAGCATCTGTGCCCAGAGCAGATGT 

TAGAAACAAAATCATGTTATGATGGACAGTGCTATGAATATAAATGGATGGCCAGTGCTT 

GGAAGGGCTCTTCCCGAACAGTGTGGTGTCAAAGGTCAGATGGTATAAATGTAACAGGGG 

GCTGCTTGGTGATGAGCCAGCCTGATGCCGACAGGTCTTGTAACCCACCGTGTAGTCAAC 

CCCACTCGTACTGTAGCGAGACAAAAACATGCCATTGTGAAGAAGGGTACACTGAAGTCA 

TGTCTTCTAACAGCACCCTTGAGCAATGCACACTTATCCCCGTGGTGGTATTACCCACCA 

TGGAGGACAAAAGAGGAGATGTGAAAACCAGTCGGGCTGTACATCCAACCCAACCCTCCA 

GTAACCCAGCAGGACGGGGAAGGACCTGGTTTCTACAGCCATTTGGGCCAGATGGGAGAC 

TAAAGACCTGGGTTTACGGTGTAGCAGCTGGGGCTJ^TrTGTGTTACTCATCTTTATTGTCT 

CCATGATTTATCTAGCTTGCAAAAAGCCAAAGAAACCCCAAAGAAGGCAAAACAACCGAC 

TGAAACCmAACCnAGCCTATGATGGAGATGCCGACATGTAACATATAACTTTTCCTG 

GCAACAACCA (SEQ ID NO: 40) 



Protein Sequence for CGI 0631 8-01 ORF Start: ISORFStop: 4782 Frame: 3 



Protein Sequence: 

>CG1 0631 8-01 -prot 1588 aa 

MGDECGPGGIQTRAVWCAHVEGWTTLHTNCKQAERPNNQQNCFKVCDWHKELYDWRLGPW 

NQCQPVISKSLEKPLECIKGEEGIQVREIACIQKDKDIPAEDIICEYFEPKPLLEQACLI 

PCQQDCIVSEFSAWSECSKTCGSGLQHRTRHWAPPQFGGSGCPNLTEFQVCQSSPCEAE 

ELRYSLHVGPWSTCSMPHSRQVRQARRRGKNKEREKDRSKGVKDPEARELIKKKRNRNRQ 

NRQENKYWDIQIGYQTREVMCINKTGKAADLSFCQQEKLPMTFQSCVITKECQVSEWSEW 

SPCSKTCHDMVSPAGTRVRTRTIRQFPIGSEKECPEFEEKEPCLSQGDGNA^CATYGWRT 

TEWTECRVDPLLSQQDKRRGNQTALCGGGIQTREVYCVQANENLLSQLSTHKNKEASKPM 

DLKLCTGPIPNTTQLCHIPCPTECEVSPWSAWGPCTYENCNDQQGKKGFKLRKRRITNEP 

TGGSGVTGNCPHLLEAIPCEEPACYDWKAVRLGDCEPDNGKECGPGTQVQEWCINSDGE 

EVDRQLCRDAIFPIPVACDAPCPKDCVLSTWSTWSSCSHTCSGKTTEGKQIRARSILAYA 

GEEGGlRCPNSSALQEVRSCNEHPCTWHWQTGPWGQCIEDTSVSSFhfTTTTVVNGEASCS 

VGMQTRKVICVRVNVGQVGPKKCPESLRPETVRPCLLPCKKDCIVTPYSDWTSCPSSCKE 

GDSSIRKQSRHRVIIQLPANGGRDCTDPLYEEKACEAPQACQSYRWKTHKWRRGQLVPWS 

VQQDSPGAQEGCGPGRQARAITCRKQDGGQAGIHECLQYAGPVPALTQACQIPCQDDCQL 

TSWSKFSSCNGDCGAVRTRKRTLVGKSKKKEKCKNSHLYPLIETQYCPCDKYNAQPVGNW 

SDCILPEGKVEVLLGMKVQGDIKECGQGYRYQAMACYDQNGRLVETSRCNSHGYIEEACI 

IF^PSDCKLSEWSNWSRCSKSCGSGVKVRSKWLREKPYNGGRPCPKLDIWNQAQ\nrE\A/P 

GHSDCNQYLWNnHPWSICKVTFVNMRENCGEGVQTRKVRCMQNTADGPSEHVEDYLCDPE 

EMPLGSRVCKLPCPEDCVISEWGPWTQCVLPCNQSSFRQRSADPIRQPADEGRSCPNAVE 

KEPCNLNKNCYHYDYTWTDWSTCQLSEKAVCGNGIKTRMLDCVRSDGKSVDLKYCEALGL 

EKNWQMhrrSCMVECPVNCQLSDWSPWSECSQTCGLTGKMIRRRTVTQPFQGDGRPCPSLM 

DQSKPCPNW>CYRWQYGQWSPCQVQEAQCGEGTRTRNISGWSDGSADDFSKVVDEEFC^ 

DIELIIDGNKNMVLEESCSQPCPGEXJYLKDWSSWSLCQLTGVNGEDLGFGGIQVRSRPVI 

IQELENQHLCPEQMLETKSCYDGCXJYEYKWMASAWKGSSRTVWCQRSDGINVTG 

QPDADRSCNPPCSQPHSYCSETKTCHCEEGYTE\^SSNSTLEQCTLIP\AAAJ*TMEDKRG 

DN^SRAVHPTQPSSNPAGRGRTWFLQPFGPDGRLKTWWGVAAGAFVLLIFIVSMIYLA 

CKKPKKPQRRQNNRLKPLTLAYDGDADM (SEQ ID NO: 41) 



Figure 16. Nucleotide and Protein Sequences for CG50817-04. 



>CG50817-04 1447 nt 

GCGGACACCAGTGATGCTCCTGGGACCCTACGCAATCTGCGCCTGCGTCTCATCAGTCGC 

CCCACATGTAACTGTATCTACAACCAGCTGCACCAGCGACACCTGTCCAACCCGGCCCGG 

CCTGG6ATGCTATGTGGGGGCCCCCAGCCTGGGGTGCAGGGCCCCTGTCAGGTCTGATAG 

GGAGAA6AGAAGGAGCAGAAGGGGAGGG6CCTAACCCTGGGCTGGGGGTTGGACTCACAG 

GACTGGGGGAAAGAGCTGCAATCAGAGGGTGTCTGCCATAGCTGGGCTCAGGCATCTGTC 

CTTGGCTTTGTTGCCTGGCTCCAGGGAGATTCCG6GGGCCCTGTGCTGTGCCTCGAGCCT 

GACGGACACTGGGTTCAGGGTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGAC 

GCTCCTGTGCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAG6CTC6AGTTCAG 

GGGGCAGCTTTCCTGGCCCAGAGCCCAGA6ACCCCGGAGATGAGTGATGAG6ACAGCTGT 

GTAGCCTGTGGATCCTTGAGGACAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCC 

TGGGAGGCCAGGCTGATGCACCAGGGACAGCTGGCGTGTGGCGGAGCCCTGGTGTCAGAG 

GAGGCGGTGCTAACTGCTGCCCACTGCTTCATTGGGCGCCAGGCCCCA6AGGAATG6AGC 

GTAGGGCTGGGGACCAGACCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCC 

TACACCCACCCTGAGGGGGGCTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACA 

CTGGGAGCCAGCCTGCGGCCCCTCTGCCTGCCCTATGCTGACCACCACCTGCCTGATGGG 

GAGCGTGGCTGGGTTCTGGGACGGGCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACA 

GTGCCCGTGACCCTCCTGGGGCCTAGGGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGT 

GATGGCAGCCCTATTCTGCCGGGGATGGTGTGTACCAGTGCTGTGGGTGAGCTGCCCAGC 

TGTGAGGCCAACCAACCAGCTGCTGACAGGGGACCTGGCCATTCTCAGGAACAA6AGAAT 

GCAGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAG 

GCACCAGGGCAGGCCCAGAAGCCCAGCAGCT6TGGGAAGGAACCTGCCTGGGGCCACAGG 

TGCCCACTCCCCACCCTGCAGGACA6GGGTGTCTGTGGACACTCCCACACCCAACTCTGC 

TACCAAGCAGGCGTCTCAGCTTTCCTCCTCCTTTACCCTTTCAGATACAATCACGCCAGC 

CACGTT6TTTTGAAAATTTCI I M I I IGGGGGGCAGCAGTTTTCCI 1 1 1 1 1 lAAACTTAA 

ATAAATT (SEQ ID NO:42) 

Protein Sequence for CG5081 7-04 ORF Start: 520 ORF Stop: 1192 Frame: 1 

Protein Sequence: 
>CG50817-04-prot 224 aa 

MSDEDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGCaACGGALVSEEAVLTAAHCFIGR 
QAPEEWSVGLGTRPEEWGLKQLILHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPYA 
DHHLPDGERGWVLGRARPGAGISSLQTVPVTULGPRACSRLHAAPGGDGSPILPGMVCTS 
AVGELPSCEANQPAADRGPGHSQEQENAGRQMALLPLSSPPCHV (SEQ ID NO:43) 



Figure 17. Nucleotide and Protein Sequences for CG50817-05. 



• Nucleotide sequence encoding the Peptidase-like protein of the invention. 

>CG50817-05 

CGCTGGGCCTCTGTCCTG ATGCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTAC 60 
CTGGCCTGGATCCTGTTCTTCGTGCTCTATGATTTCTGCATTGTTTGTATCACCACCTAT 1 20 
GCTATCAACGTGAGCCTGATGTGGCTCAGTTTCCGGAAGGTCCAAGAACCCCAGGGCCAA 1 80 
CCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCGAGTGGCCCTGGCAGGGCAGTGTGAGG 240 
AGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACACCTGGGTCCTCACTGCT 300 
GCCCACTGCTTTGAAAAGGCAGCAGCAACAGAACTGAATTCCTGCGTGAGGGACTCAGCC 360 
CCTGGGGGCGAAGAGGTGGGGGTGGGTGCCCTGCAGTTGCCCAGGGCCTATAACCACTAC 420 
AGCCAGGGCTCAGACCTGGCCCTGCTGCAGCTCGCCCACCCCACGAGCCACACACCCCTC 480 
TGCCTGCCCCAGCCCGCCCATCGCTTCCCCTTTGGAGCGTCCTGCTGGGCCACTGGCTGG 540 
GATCAGGACACCAGTGATGCTCCTGGGACCCTACGCAATCTGCGCCTGCGTCTCATCAGT 600 
CGCCCCACATGTAACTGTATCTACAACCAGCTGCACCAGCGACACCTGTCCAACCCGGCC 660 
CGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTGGGGTGCAGGGCCCCTGTCAGGGAGAT 720 
TCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGACACTGGGTTCAGGCTGGCATCATC 780 
AGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGTGCTGCTGACCAACACAGCTGCT 840 
CACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTTTCCTGGCCCAGAGCCCAGAG 900 
ACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGATCCTTGAGGACAGCAGGT 960 
CCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCTGATGCACCAGGGACAG 1020 
CTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTGCTGCCCACTGCTTC 1 080 
ATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGACCGGAGGAGTGG 1 140 
GGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGGCTACGACATG 1 200 
GCGCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCCTCTGCCTG 1260 
CCCTATGCTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGGGCCCGC 1 320 
CCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAGGGCC 1380 
TGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGGTG 1440 
TGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGCCAACCAACCAGCTGCTGACAGG 1 500 
GGACCTGGCCATTCTCAGGAACAAGAGAATGCAGGCAGGCAAATGGCATTACTGCCCCTG 1 560 
TCCTCCCCACCCTGTCATGTGTG ATTCCAGGC 1592 
(SEQ ID NO: 44) 



Protein sequence encoded by the coding sequence shown above. 

>CG50817-05 

MLLSSLVSUVGSWU^WILFFVLYDFCIVCITIYAINVSLMWLSFRKVQEPQGQPK 60 
NTVPGEWPWQASVRRQGAHICSGSLVADTVVVLTAAHCFEKAAATELNSCVRDSAPGAEEV 1 20 
GVAALQLPRAYNHYSQGSDLALLQLAHPTTHTPLCLPQPAHRFPFGASCWATGWDQDTSD 180 
APGTLRNLRLRLISRPTCNCIYNQLHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVL 240 
CLEPDGHWVQAGIISFASSCAQEDAPVLLTNTAAHSSWLQARVQGAAFLAQSPETPEMSD 300 
EDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGRC3AP 360 
EEWSVGLGTRPEEWGLKQLILHGA\THPEGGYDMALLLLAQPVTLGASLRPLCLPYADHH 420 
LPDGERGWVLGRARPGAGISSLQTVPVTLLGPRACSRLHAAPGGDGSPILPGMVCTSAVG 480 
ELPSCEANQPAADRGPGHSQEQENAGRQMALLPLSSPPCHV 521 

(SEQIDNO:45) 



Figure 18. Nucleotide and Protein Sequences for CG50817-06. 



Nucleotide sequence encoding the Peptidase-lilce protein of the invention. 

>CG50817-06 

AGCGACACCTGTCCAACCCGGCCCGGCCTGGGA TGCTATGTGGGGGCCCCCAGCCTGGGG 60 
TGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGAC 120 
ACTGGGTTCAGGGTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTG 1 80 
TGCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAG 240 
CTTTCCTGGCCCAGAGCCCAGAGACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCT 300 
GTGGATCCTTGAGGACAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGG 360 
CCAGGCTGATGCACCAGGGACAGGTGGCCTGTGGCGGAGCGCTGGTGTCAGAGGAGGCGG 420 
TGCTAACTGCTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGC 480 
TGGGGACCAGACCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCC 540 
ACCCTGAGGGGGGCTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAG 600 
CCAGCCTGCGGCCCCTCTGCCTGCCCTATGCTGACCACCACCTGCCTGATGGGGAGCGTG 660 
GCTGGGTTCTGGGACGGGCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCG 720 
TGACCCTCCTGGGGCCTAGGGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCA 780 
GCCCTATTCTGGCGGGGATGGTGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGG 840 
CCAACCAACCAGCTGCTGACAGGGGACCTGGCCATTCTCAGGAACAAGAGAATGCAGGCA 900 
GGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTG ATTCCAGGCACCAG 960 
GGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCAC 1020 
TCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAG 1080 
CAGGCGTCTCAGCTTTCCTCCTCCTTTACCCTTTCAGATACAATCACGCCAGCCACGTTG 1140 
TTTTGAAAATTTC I 1 1 1 1 i IGGGGGGCAGCAGTTTTCCTTTTTTTAAACTTAAATAAATT 1200 
(SEQ ID NO:46) 



Protein sequence encoded by the coding sequence shown above. 

>CG50817-06 

MLCGGPQPGVQGPCQGDSGGPVLCLEPDGHWVQAGIISFASSCAQEDAPVLLTNTAAHSS 60 
WLQARVQGAAFUVQSPETPEMSDEDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGQLAC 120 
GGALVSEEAVLTAAHCFIGRQAPEEWSVGLGTRPEEWGLKQLILHGAYTHPEGGYDMALL 1 80 
LLAQPVTLGASLRPLCLPYADHHLPDGERGWVLGRARPGAGISSLQTVPVTLLGPRACSR 240 
LHAAPGGDGSPILPGMVCTSAVGELPSCEANQPAADRGPGHSQEQENAGRQMALLPLSSP 300 
PCHV 304 
(SEQ ID NO: 47) 



Figure 19. Nucleotide and Protein Sequences For CG51099-03. 



Nucleotide sequence encoding the Serine Protease-like protein of the invention. 



>CG51099-03 

CGGAGAGACGCAGTCGGCTGCCACCCCGGGA TGGGTCGCTGGTGCCAGACCGTCGCGCGC 6 0 

GGGCAGCGCCCCCGGACGTCTGCCCCCTCCCGCGCCGGTGCCCTGCTGCTGCTGCTTCTG 120 

TTGCTGAGGTCTGCAGGTTGCTGGGGCGCAGGGGAAGCCCCGGGGGCGCTGTCCACTGCT 180 

GATCCCGCCGACCAGAGCGTCCAGTGTGTCCCCAAGGCCACCTGTCCTTCCAGCCGGCCT 24 0 

CGCCTTCTCTGGCAGACCCCGACCACCCAGACACTGCCCTCGACCACCAtGGAGACCCAA 3 00 

TTCCCAGTTTCnXSAAGGCyVAAGTCGACCCATACCGCTCCTGTGGCTTTTCC^^ 360 

GACCCCACCCTCAGGGACCCAGAAGCCGTGGCTCGGCGGTGGCCCTGGATGGTCAGCGTG 420 

CGGGCCAATGGCACACACATCTGTGCCGGCACCATCATTGCCTCCCAGTGGGTGC^^ 4 80 

GTGGCCCACTGCCTGATCTGGCGTGATGTTATCTACTCAGTGAGGGTGGGGAGTCCGTGG 540 

ATTGACCAGATGACGCAGACCGCCTCCGATGTCCCGGTGCTCCAGGTCATCATGCATAGC 600 

AGGTACCGGGCCCAGCGGTTCTGGTCCTGGGTGGGCCAGGCCAACGACATCGGCCTCCTC 660 

AAGCTCAAGCAGGAACTCAAGTACAGCAATTACGTGCGGCCCATCTGCCTGCCTGGCACG 720 

GACTATGTGTTGAAGGACCATTCCCGCTGCACTGTGACGGGCTGGGGACTTTCCAAGGCT 780 

GACGGCATGTGGCCTCAGTTCCGGACCATTCAGGAGAAGGAAGTCATCATCCTGAACAAC 840 

AAAGAGTGTGACAATTTCTACCACAACTTCACCAAAATCCCCACTCTGGTTCAGATCATC 900 

AAGTCCCAGATG ATGTGTGCGGAGGACACCCACAGGGAGAAGTTCTGCTATGAGCTAACT 960 

GGAGAGCCCTTGGTCTGCTCCATGGAGGGCACGTGGTACCTGGTGGGATTGGTGAGCTGG 1020 

GGTGCAGGCTGCCAGAAGAGCGAGGCCCCACCCATCTACCTACAGGTCTCCTCCTACCT^ 1080 

CACTGGATCTGGGACTGCCTCAACGGGCAGGCCCTGGCCCTGCCAGCCCCATCCAGGACC 1140 

CTGCTCCTGGCACTCCCACTGCCCCTCAGCCTCCTTGCTGCCCTCTG ACTCTGTGTGCCC 1200 

TCCCTCACTTGTGA 1214 
(SEQ ID NO:48) 

Protein sequence encoded by the nucleotide sequence shown above. 



>CG51099-03 

MGRWCQTVARGQRPRTSAPSRAGALLLLLLLLRSAGCWGAGEAPGALSTADPADQSVQCV 60 
PKATCPSSRPRLLWQTPTTQTLPSTTMETQFPVSEGKVDPYRSCGFSYEQDPTLRDPEAV 120 
ARRWPWMVSVRANGTHICAGTIIASQVWLWAHCLIWRDVIYSVRVGSPWIDQMTQTASD 1 80 
VPVLQVIMHSRYRAQRI^SWVGQANDIGLLKLKQELKYSNYVRPICLPGTDYVLKDHSRC 240 
TWGWGLSKADGMWPQFRTIQEKEVIILNNKECDNFYHNFTKIPTLVQIIKSQMMCAEDT 300 
HREKFGYELTGEPLVCSMEGTWYLVGLVSWGAGCQKSEAPPIYLQVSSYQHWIWDCLNGQ 360 
ALALPAPSRTLLLALPLPLSLU\AL 385 (SEQ ID NO:49) 



Figure 20. Nucleotide and Protein Sequences For CG57051-04. 



Nucleotide sequence encoding the Angiopoietin-like protein, CG57051-04. 



>CG57051-04 

TGCGGATCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAAGCGGGT 60 
CTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGAGAG 120 
TCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGGA TGAGCGGTGCTCGGACGGCCGGGGC 180 
AGCCCTGATGCTCTGCGGCGCCACCGCCGTGCTACTGAGCGCTAGATCTGGACCCGTGCA 240 
GTCCAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCT 300 
GCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCT 360 
GGAGCGGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCT 420 
CCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACT 480 
CAAGGCTCAGAACAGCAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCA 540 
CCTGGAGAAGCAGCACCTGCGAATTCAGCATGTGCAAAGCCAGTTTGGCCTCCTGGACCA 600 
CAAGCACCTAGACCATGAGGTGGCCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGC 660 
CCAGCCAGTTGACCCGGCTCACAATGTCAGCCGCGTGCACCGAGGCTGGTGGTTTGGCAC 720 
CTGCAGCCATTCCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACAGCAGCGGCAGAA 780 
GGTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTACTACCCGCTGCAGGCCAC 840 
CACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCTAG CGTCCTGGCTGGGCCTG 900 
GTCCCAGGCCCACGAAAGACGGTGACTCTTGGCTCTG 937 (SEQ ID NO:50) 



Protein sequence encoded by the nucleotide sequence shown above. 



>CG57051-04 

MSGAPTAGAALMLCAATAVLLSARSGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHAE 60 
RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 
HKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSR 180 
LHRGVWFGTCSHSNLNGQYFRSIPQQRQKLKKGIFWKmRGRYYPLQATTMLIQPMAAEA 240 
AS 242 (SEQ ID N0:51) 




igure 21. Nucleotide and Protein Sequences For CG57051-05. 

Nucleotide sequence encoding the Angiopoietin-like protein, CG57051-05. 



>CG57051-05 

CTTCGTCTCCAGTCCTCGCACCTGGAACCCCAACX3TCCCCGAGAGTCCCCGAATCCCCGC 60 

TCCCAGGCTACCTAAGAGGA TGAGCGGCGCTCCX3ACGGCCGGGGCAGCCCTGATGCTCTG 120 

CGCCGCCACCGCCGTGCTACTGAGCGCTCAGGGCGGACCCGTGCAGTCCAAGTCGCCGCG 180 

CTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCTGCAGCTCGGCCAGGG 240 

GCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCX3CTGGAGCGGCGCCTGAG 300 

CGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCTCCCGTTAGCCCCTGA 360 

GAGCCX3GGTGGACCCTGAGGTCCrTa\CAGCCroCAGACAC^ 420 

CAGGATCCAGCAACrCTTCCACAAGGTGGCCCAGCAGCAGCGGCACCT^^ 4 80 

CCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCTCCTGGACCACyVAG 540 

TGAGGGTGGCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCC 600 

GGCTCyVCAATGTCAGCCGCCTGCACCATGGAGGCTGGACAGTAATTCAGAGGCGCCACG^ 660 

TGGCTCAGTGGACTTCAACCGGCCCTGGGAAGCCTACAAGGCGGGGTTTGGGGATCCCCA 720 

CGGCGAGTTCTGGCTGGGTCTGGAGAAGGTGCATAGCATCATGGGGGACCGCAACAGCCG 780 
CCTGGCCGTGGAGCTGCGGGACTGGGATGGCAACXXrCXJAGTTGCTGC^ 84 0 

CCTGGGTGGCGAGGACACGGCCTATAGCCTGCAGCTCACTGCACCCGTGGCCGGCCAGCT 900 
GGGCGCCACCACCGTCCCACCa^GOSGCCTCTCaSTACCCTrCTCCACTT^^ 960 

TCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAGCCTCTCTGGAGGCTGGTGGTTTGG 1020 

CACCTGCAGCCATTCCAACCTCAACGGCC7VGTACTTCCGCTCCATCCCACAG(^GCGG 1080 

GAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTACTACCCGCTGCAGGC 114 0 

CACCACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCTA GCGTCCTGGCTGGGC 1200 
CTGGTCCCAGGCCCACGAAAGAGGTGACTCTTGGCTCTG 1239 (SEQ ID NO: 52) 

Protein sequence for Angiopoietin-like protein, CG57051-05. 



>CG57051-05 

MSGAPTAGAALMLCAATAVLLSAQGGPVQSKSPRFASWDEMNVIJ^GLLQLGW 6 0 

RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 

HKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEGGKPARRKRLPEMAQPVDPAHNVSR 180 

LHHGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEKVHSIMGDRNSRLAVQLR 240 

DWDGNAELLQFSVHLGGEDTAYSLQLTAPVAGQI/SATTVPPSGLSVPFSTWr^ 300 

KNCAKSLSGGWWFGTCSHSNLNGQYFRS I PQQRQKLKKGI FWKTWRGRYYPLQATTMLI Q 360 
PMA/^EAAS 368 (SEQ ID NO: 53) 



igure 22. Nucleotide and Protein Sequences For CG57051-02. 

Nucleotide sequence encoding the Angiopoietin-like protein of the invention. 



>CG57051_02 

TGCGGATCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAAGCGGGT 60 

CTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGAGAG 120 

TCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGGA TGAGCGGTGCTCCGACGGCCGGGGC 180 

AGCCCTGATGCrCTGCGCCGCCACCGCCGTGCTACrGAGCGCTAGATCTGGACCCGTGCA 240 

GTCCAAGTax:CGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGC^ 300 

GCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCT 360 

GGAGCGGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCT 420 

CCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACT 4 80 

CAAGGCTCAGAACAGCAGGATCCAGCy^CTCTTCCyVCAAGGTGGCCCAGCAGC^ 540 

CCTGGAGAAGCAGCACCTGCGAATTOVGCATCTGCAAAGCa^GTTTGGCCT^ 600 

CAAGCACCTAGACCATGAGGTGGCCAAACCT6CCCGAAGAAAGAGGCTGCCCX3A6ATGGC 660 

CCAGCCTVGTTGACCCGGCTCACAATGTCAGCCGCCTGCACCATGGAGGCTGGAOVGTAA 720 

TCAGAGGCGCCACGATGGCTCAATGGACTTCAACCGGCCCTGGGAAGCCTACAAGGCGGG 780 

GTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGGAGAAGGTGCATAGCATCACGGG 840 

GGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACTGGGATGGCAACGCCGAGTTGCT 900 

GCAGTTCTCCGTGCACCTGGGTGGCGAGGACACGGCCTATAGCCTGCAGCTCACTGCACC 960 

CGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGTACCCTT 1020 

»CITGGGACCAGGATCAC6ACCTCCGa^GGGACAAGJUV^^ 1080 

CCCATCGGTGGCTCAAAGACCTGACCATGTTCCCTCTCCCCTGACCCCGGCAGGAGGCTG 1140 

GTGGTTTGGCACCTGCAGCCATTCCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACA 1200 

GCAGCGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTACTACCC 1260 
GCTGCAGGCCACCACCATGTTGATCCAGCCCATGGO^GCAGAGGCAGCCrCC^^ 1315 
(SEQ ID NO: 54) 



Protein sequence for CG5705 1 -02. 

>CG57051_02 

MSGAPTAGAAIiMLCAATAVLLSARSGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHA^ 6 0 

RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 

HKVAQQQRHLEKQHLRI QHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSR 180 

IJfflGGWTVIQRRHDGSl^FNRPWEAYKAGFGDPHGEFWLGLEKVHSITGDRNSRI^ 240 

DWIXSNAELLQFSVHIX^GEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQDHDIiRRD 300 

KNCAKSLSAPSVAQRPDHVPS PLTPAGGWWFGTCSHSNLNGQYFRS I PQQRQKLKKGI FW 360 
KTWRGRYYPLQATTMLIQPMAAEAAS 386 (SEQ ID NO: 55) 



ure 23. Nucleotide and Protein Sequences For CG57051-03. 

Nucleotide sequence encoding the Angiopoietin-like protein, CG57051-03. 



>CG57051-03 

CCCCGAGAGTCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGGA TGAGCGGTGCTCCGAC 60 

GGCCGGGGCAGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGA6CGCTCAGGGCGG 120 

ACCCGTGCAGTCCAAGTCGCCGCGCTTTGCGTCCIXXXSACGAGATGAATG^ 180 

CGGACTCCTGCAGCTCGGCaVGGGGCTGCGCGAACACGCGGAGCGCACCCGC^ 240 

GAGCGCX5CTGGAGCGGCGCCTGAGCGCGTGCGGGTCa3CCTGTCAGGGAACCGAGGGGTC 300 

CACCGACCTCCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCA 360 

GACACAACTCAAGGCTCAGAACAGCAGGATCCAGCAACTCTTCCAC^ 420 

GCAGCGGCACCTGGAGAAGCAGCACCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCT 480 

CCTGGACCACAAGCACCTAGACCATGAGGTGGCCAAGCCTGCCCGAAGAAAGAGGCTGCC 540 

CGAGATGGCCCAGCCAGTTGACCCG6CTCACAATGT«GCCXX:CTGCACCATGGA 600 

GACAGTAATTCAGAGGCGCCACGATGGCTCAGTGGACTTCAACCX3GCCCTGGGAAGCCTA 660 

CAAGGCGGGGTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGGAGAAGGTCa 720 

CATCACGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACTGGGATGACAACGC 780 

CGAGTTGCTGCAGTTCTCCGTGCACCrGGGTGGCGAGGACACGGCCTATAGCCTGCAGCT 840 

CACTGCACCCGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGT 900 

ACCCTTCCCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAACTGCGCC^ 960 

CCTCTCTGGAGGCTGGTGGTTTGGCACCTGCAGCCATTCCAACCTaVACGGCCAGTAC^ 1020 

CCGCTCCATCCCACAGCAGCGGaiGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGC^ 1080 

GGGCC6CTACTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCATGGCAGCAGAGGC 1140 
AGCCTCCTAG 1150 (SEQ ID NO: 56) 



Protein sequence for CG57051-03. 



>CG57051-03 

MSGAPTAGAAU^LCAATAVLLSAQGGPVQSKSPRFASWDEMNVLAHGLLQLGQGU^ 6 0 

RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLF 120 

HKVAQQQRHLEKQHLRIQHLQSQFGLIJDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSR 180 

LHHGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWU5LEKVHSITGDRNSRLAVQLR 240 

DWDDNAEI^FSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFPTWDQDHDLRRD 300 

KNCAKSLSGGWWF6TCSHSNLNGQYFRS I PQQRQKLKKGI FWKTWRGRYYPLQATTMLIQ 360 
PMAAEAAS 368 (SEQ ID NO: 57) 



